Draft version February 27, 2013 

Preprint typeset using I^'T^]X style eniulatoapj v. 5/2/11 



(N 

o 



l> 



o 
u 

o 



> 

a^ 

00 

rn 

rn 

O 



1 



THE DEEP2 GALAXY REDSHIFT SURVEY: 
THE VORONOTDELAUNAY METHOD CATALOG OF GALAXY GROUPS 

Brian F. Gerke^-^, Jeffrey A. Newman'*, Marc Davis^, Alison L. Coil^, Michael C. Cooper'', Aaron A. Button'', 
S. M. Faber**, Puragra Guhathakurta**, Nicholas Konidaris'\ David C. Koo**, Lihwai Lin^°, Kai Noeske", 
Andrew C. Phillips*^, David J. Rosario^^, Benjamin J. Weiner^"', Christopher N. A. Willmer^^, Renbin Yan" 

Draft version February 27, 2013 

ABSTRACT 

We present a public catalog of galaxy groups constructed from the spectroscopic sample of galaxies 
in the fourth data release from the DEEP2 Galaxy Redshift Survey, including the Extended Groth 
Strip (EGS). The catalog contains 1165 groups with two or more members in the EGS over the redshift 
range < z < 1.5 and 1295 groups at z > 0.6 in the rest of DEEP2. 25% of EGS galaxies and 14% of 
high-z DEEP2 galaxies are assigned to galaxy groups. The groups were detected using the Voronoi- 
Delaunay Method, aft er it has been opti mized on mock DEEP2 catalogs following similar methods 
to those employed in iGerke et al.l ()2005| ). In the optimization effort, we have taken particular care 
to ensure that the mock catalogs resemble the data as closely as possible, and we have fine-tuned 
our methods separately on mocks constructed for the EGS and the rest of DEEP2. We have also 
probed the effect of the assumed cosmology on our inferred group-finding efficiency by performing 
our optimization on three different mock catalogs with different background cosmologies, finding 
large differences in the group-finding success we can achieve for these different mocks. Using the 
mock catalog whose background cosmology is most consistent with current data, we estimate that 
the DEEP2 group catalog is 72% complete and 61% pure (74% and 67% for the EGS) and that the 
group-finder correctly classifies 70% of galaxies that truly belong to groups, with an additional 46% of 
interloper galaxies contaminating the catalog (66% and 43% for the EGS). We also confirm that the 
VDM catalog reconstructs the abundance of galaxy groups with velocity dispersions above ~ 300km 
s^^, to an accuracy better than the sample variance, and that this successful reconstruction is not 
strongly dependent on cosmology. This makes the DEEP2 group catalog a promising probe of the 
growth of cosmic structure that can potentially be used for cosmological tests. 
Subject headings: Galaxies: high-redshift — galaxies: clusters: general 



I. INTRODUCTION 



The spherical or ellipsoidal gravitational collapse of an 
overdense region of space in an expanding background 
is a simple dynamical problem that can be used as an 
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Ansatz to predict the mass distribution of massive col- 
lapsed structures in the Cold Dark Matter cosmolog- 
ical para digm, as a function of t he cosmological pa- 
rameters (iPress fc S chechtej 119741: iBardeen etall 119861: 
iSheth fc TormenI 12002). This has led to the widespread 
use galaxy clusters and groups as convenient cosmo- 
logical probes. In addition, it has long been appar- 
ent that the galaxy population in groups and clusters 
differs in its pr operties from the gener a l populat ion of 
galaxies (e.g., iSpitzer fc Baadd Il951l : iPiressleH I1980D 
and that the two populations exhibit different evolution 
(Butcher & Ocmler 1984). This suggests that galaxy 
groups and clusters can be used as laboratories for study- 
ing evolutionary processes in galaxies. For both of these 
reasons, a catalog of groups and clusters has been derived 
for every large survey of galaxies. 

The history of group and cluster finding in galaxy 
surveys includes a wide variety of detection methods, 
starting with th e visu a l dete ction of local clusters in 
imaging data by lAbelll (|1958f ) . The approaches can be 
broadly divided into two categories: those that use pho- 
tometric data only, and those that use spectroscopic 
redshift information. In relatively shallow photomet- 
ric data, it is possible to find clusters by simply look- 
ing for overdensities in the on-sky galaxy distribution, 
but in modern, deep photometric surveys, foreground 
and background objects quickly overwhelm these density 
peaks at all but the lowest redshifts. Recent photomet- 
ric cluster-finding algorithms thus typically also rely on 



assumptions about the properties of galaxies in clusters, 
on photometric re dshift estimates, or on a combination 
of the tw o fe.g.. iPostman e t"al''1996': 'Gladdcrs & Yed 
2000; Kocs ter et al.ll2007tlLi fc Yee 2008; Liu ct al. 2008.; 
Adami ct al. 2010; Hao et^ I20ia IMilkeraitis et alJ 
2010; Soarcs-Santos et a l. 201l(). 



Spectroscopic galaxy redshift surveys remove much of 
the problem of projection effects from cluster-finding ef- 
forts, though not all of it, owing to the well-known fingcr- 
of-god effect. Since closely neighboring galaxies in red- 
shift space can be assumed to be physically associated, it 
is possible to use spectroscopic surveys to reliably detect 
relatively low- mass, galaxy-poor systems (i.e., galaxy 
groups), in addition to rich, massive clusters. The most 
popular approach historically has been the friends-of- 
friends, or percolation, algorithm, which links galaxies 
together with their neighbors that lie within a given link- 
ing length on the sky and in redshift space, without refer- 
ence to galaxy properties. T his technique was pion eered 
in the CfA redshift survey ()Huchra fc Gelleiill982[ ) and 
is stil l in common use in present-day redshift surveys 
(e.g lEke et al.|[200l IBerlind eTallllool fKnobel et al] 
|2009( ). Recently, other redshift-space algorithms have 
also had success by including simple assumptions about 
the properties of galaxies in clust ers and groups (e.g., 
iMiUer et al.|[2005HYang eral|[2005l ). The primary disad- 
vantage of cluster-finding in redshift-space data is that 
spectroscopic surveys generally cannot schedule every 
galaxy for observation, leading to a sparser sampling of 
the galaxy population than is available in photometric 
data. When the sampling rate becomes extremely low, 
standard methods like friends-of-friends have a very high 
failure rate. This is a particular concern for high-redshift 
surveys, for which spectroscopy is very observationally 
expensive. 

In any case, since cluster-finding algorithms search 
for spatial associations in a pointlike dataset, it can be 
shown that a perfect reconstruction of the true, underly- 
ing bound systems can never be achieved owing to ran- 
dom noise (Szapudi & Szalay 1996). Indeed, it has long 
been known that a fundamental trade-off exists between 
the purity and completeness of a cluster catalog when 
compared with the underl ying dark-matter halo pop - 
ulation in N-body models (jNolthenius fc White! ll987D : 
a catalog cannot be constructed that detects all exist- 
ing clusters and is free of false detections. In order to 
fully understand and minimize these inevitable errors, it 
has become standard practice to use mock galaxy cata- 
logs, based on TV-body dark-matter simulations, to test 
cluster-finding algorithms, optimize their free parame- 
ters, and estimate the level of error in the final catalog. 
In all such studies, some effort has been made to en- 
sure that the mock catalogs resemble the data at least 
in a qualitative sense, but little work has been done to 
examine how quantitative differences between the mocks 
and the data, or inaccuracies in the assumed background 
cosmology, will impact the group-finder calibration. 

In this paper, we present a catalog of galaxy groups and 
clusters for the final da ta release (DR4) of t he DEEP2 
Galaxy Redshift Survey ([Newman et al.l[2012[ ). a spectro- 
scopic survey of tens of thousands of mostly high-redshift 
galaxies, with a median redshift around z = 0.9. The cat- 
alog is made available to the public on the DEEP2 DR4 



webpag43- To construct this catalog, we make use of 
the the Voronoi-Delaunay Metho d (VDM) group-finder , 
which was originally developed bv lMarinoni et all (|2002[ ) 
for use in relatively sparsely sampled, high-redshift sur- 
veys similar to DEEP2. To test and calibrate our meth- 
ods, we make use of a set of realistic mock galaxy cata- 
logs that we have recently constructed for DEEP2 (Gerke 
et al. in preparation). These catalogs have been con- 
structed for several different background cosmologies, al- 
lowing us to test the impact of cosmology on the group- 
finder calibration and error rate. This work updates and 
expan ds upon the group-finding efforts of iGerke et al.l 
(|2005D (hereafter G05), who detected groups with the 
VDM algorithm in early DEEP2 data using an earlier 
set of DEEP2 mocks for calibration. 

Our goals in constructing this catalog are similar to 
the historical ones described above. First, a catalog of 
galaxy groups is an interesting tool for studying the evo- 
lution of the galaxy population in DEEP2, as well as for 
studying the baryonic astrophysics of groups and clusters 
themselves, as has bee n demonstrated in various papers 
using the G 05 catalog (Fang et al."2006"; 'Coil et aL"200 ' 



Gerke et al . 2007; Gcorgakakis ct al. 2008; Jcltcma et al 
20091 ). In addition, it has been shown that a catalog of 



groups from a survey like DEEP2 can be used to probe 
cosmological parameters, including the equation of state 
of the dark energy, by counting groups as a function 
of th eir redshift and velocity dispersion ([Newman et al.l 
l2002f l: we aim to produce a group catalog suitable for 
that purpose here. 

We proceed as follows. In Section [5] we introduce the 
DEEP2 dataset and describe our methods for construct- 
ing realistic DEEP2 mock catalogs with which to test 
and refine our group-finding methods. Section [3] details 
the specific criteria we use for such testing. In Section [4] 
we give a brief overview of VDM, including some changes 
to the G05 algorithm, and we optimize the algorithm on 
our mock catalogs in Section [5] The latter section also 
explores the dependence of our optimum group-finding 
parameters on the assumed cosmology of the mock cat- 
alogs. Section [6] presents the DEEP2 group catalog and 
compares it to other high-redshift spectroscopic group 
catalogs. Throughout this paper, where necessary and 
not otherwise specified, we assume a fiat ACDM cosmol- 
ogy with r^M = 0.3 and h = 0.7. 

2. THE DEEP2 SURVEY AND MOCK CATALOGS 

2.1. The DEEP2 dataset 

The DEEP2 (Deep Extragalactic Evolutionary Probe 
2) Galaxy Redshift Survey is the largest spectroscopic 
survey of homogeneously selected galaxies at redshifts 
near unity. It consists of some 50,000 spectra obtained 
in one-hour expos ures with the DEIMOS spectrograph 
([Faber et al.ll2003D on the Keck II telescope. This dataset 
yielded more than 35,000 confirmed galaxy redshifts; the 
rest were either stellar spectra or failed to yield a reli- 
able redshift ident ification. DEEP2 wil l be comprehen- 
sively described in lNewman et al . (2012); most d etails of 
the survey c an als o be f ound in Willmer et al.l (f2006.) . 
iDavis et al.l ([2OO40 . and iDavis et al.l ([2007[ ). Here we 
summarize the main survey characteristics, focusing on 
issues of particular importance for group finding. 

^^ [iittp : //deep . berkeley ■ edu/drj] 



DEEP2 comprises four separate observing fields, cho- 
sen to lie in regions of low Galactic dust extinction that 
are also widely separated in RA to allow for year-round 
observing. With a combined area of approximately three 
square degrees, the DEEP2 fields probe a volume of 
5.6 X 10^ h"^ Mpc^ over the primary DEEP2 redshift 
range 0.75 < z < 1.4. This is an excellent survey volume 
for studying galaxy groups: at the relevant epochs, one 
expects to find more than one thousand dark matter ha- 
los with masses in the range of galaxy groups (roughly 
5 X 10'^'^Mq < Mhaio < 1 X 10'^^Mq) in a volume of 
this size. DEEP2 is less well suited for studying clus- 
ters: at most there should be a few to a few tens of halos 
with cluster masses {Mhaio > 1 x lO^Mg) in the DEEP2 
fields. Since our final catalog will be dominated by ob- 
jects that are traditionally referred to as groups (rather 
than clusters) we will use that term throughout this work 
as a shorthand to refer to both groups and clusters. 

DEEP2 spectroscopic observations were carried out us- 
ing the 1200-line diffraction grating on DEIMOS, giving a 
spectral resolution of i? ^ 6000. This yields a velocity ac- 
curacy of ^ 30 km/s (measured from repeat observations 
of a subset of targets). Such high-precision velocity mea- 
surements make DEEP2 an excellent survey for detect- 
ing galaxy groups and clusters in redshift space, which 
is our strategy here. The velocity errors are substan- 
tially smaller than typical galaxy peculiar velocities in 
groups, so the dominant complication for redshift-space 
group- finding will be the finger-of-god effect, rather than 
redshift-measurement error. 

Targets for DEIMOS spectroscopy were selected down 
to a limiting magnitude of i? = 24.1 from three- 
band {BRI) photometric observations taken with the 
CFH12k imager o n the Canada-France-Hawaii Telescope 
(|Coil et al.ll2004al) . To focus the survey on typical galax- 
ies at 2; ~ 1 (rather than low-z dwarfs) most DEEP2 
targets were also restrictied to a region oi B — R ver- 
sus R — I color-color space that was chosen to con- 
tain a nea rly complete sample of galaxies at z > 0.75 
(| Davis et a l. 2004) . Tests with spectroscopic samples ob- 
served with no color pre-selection show that the DEEP2 
color cuts exclude the bulk of low-redshift targets, while 
still incl uding ^ 97% of gala xies in the range 0.75 < 
z < 1.4 (iNewman et~aI1 1201 2[ ) . (At z > 1.4— in the so- 
called "redshift desert" — it is difhcult to obtain success- 
ful galaxy redshifts because of a lack of spectral features 
in the observed optical waveband.) 

Despite the high completeness of the DEEP2 color se- 
lection at high redshift, there remain a number of ob- 
servational effects that reduce the sampling density of 
galaxies in groups and clusters. The simplest is the faint 
apparent magnitude range of z ~ 1 gal axies. DEEP 2 
is lim ited to luminous galaxies (L > L*; iWillmer et al.l 
l2006f ) at most redshifts of interest; even massive clus- 
ters will contain a few tens of such galaxies at most. At 
redshifts near z = 0.9, DEEP2 has a number density of 
galaxies n ~ 0.01 (Newman et al. in preparation), cor- 
responding to a fairly sparse galaxy sample with mean 
intergalaxy separation s ~ 5h~^Mpc (comoving units). 
The DEEP2 group sample will thus be made up of sys- 
tems with relatively low richnesses. 

A further complication arises from the effects of k- 
corrections on high-rcdshift galaxies, which translate 



the DEEP2 i?-band apparent magnitude limit into a 
an evolving, color-dependent luminosity cut in the rest 
fra me of DEEP 2 galaxies. As discuss e d in detail 
in IWillmer et all (|2006l ) and iGerke et al.l (pOOTi ). red- 
sequence galaxies in DEEP2 will have a brighter absolute 
magnitude limit than blue galaxies at the same redshift, 
and this disparity increases rapidly with redshift as the 
observed R band shifts through the rest-frame B ban d 
and into the U band (c/. Figure 2 of IGerke et aIll2007D . 
Galaxies on the red sequence are well known to pref- 
erentially inhabit the overdense environments of groups 
and clusters, and this relation holds a t z ~ 1 in DEEP2 
(jCooper et al.ll2007l : [Gerke et al.ll2007D . This means that 
groups and clusters of galaxies in DEEP2 will have a 
lower sampling density than the overall galaxy popula- 
tion, and the observed galaxy population in groups will 
be skewed toward more luminous objects. 

Further undersampling of DEEP2 group galaxies re- 
sults from the unavoidable realities of multiplexed spec- 
troscopy. DEEP2 spectroscopic targets were observed 
using custom-designed DEIMOS slitmasks that allowed 
for simultaneous observations of more than 100 targets. 
Although slits on DEEP2 masks could be made as short 
as 3", and some slits could be designed to observe two 
neighboring galaxies at once, the requirement that slits 
not overlap with one another along the spectral direc- 
tion of a mask inevitably limits the on-sky density of 
targets that can be observed. Overall, DEEP2 observed 
roughly 65% of potential targets, but this fraction is nec- 
essarily lower in crowded regions on the sky owing to slit 
conflicts. The adaptive DEEP2 slitmask-tiling strategy 
relieves crowding issues somewhat, since each target has 
two chances for selection on overlapping slitmasks, but 
there is still a distinct anticorrelation between targeting 
rate and target density: the sampling rate for targets in 
the most crowded regions on th e sky is roughly 70% of 
the median sampling rate fG05. [Newman et al.ll201^ . 

Nevertheless, as discussed in G05, the significant line- 
of-sight distance covered by DEEP2 means that high- 
density regions on the sky do not necessarily correspond 
to high-density regions in three-space. The impact of slit 
conflicts on the sampling of groups and clusters should 
therefore be lower than the effect seen in crowded re- 
gions of sky. We can test this explicitly using simu- 
lated galaxies in the mock catalogs described in Sec- 
tion 12.21 Galaxies in the mocks are selected using the 
same slitmask-making algorithm as used for DEEP2, and 
since the mocks also contain information on dark-matter 
halo masses, it is possible to investigate the effect of 
this algorithm on the sampling rate in group-mass and 
cluster-mass halos. As shown in Figure [Tl galaxies in 
massive halos are undersampled relative to field galaxies, 
but the effect is modest, amounting to less than a 10% 
reduction in sampling rate at group masses and only a 
^ 20% reduction for the most massive clusters in the 
mocks. 

Redshift failure is a final factor that impacts the sam- 
pling rate of groups and clusters. After visual inspection, 
roughly 30% of DEEP2 spectra fail to yield a reliable red- 
shift (i.e., do not receive DEEP2 redshift quality flag 3 
or 4, which correspond to 95% and 99% confidence in the 
redshift identification, respectively). These redshift fail- 
ures are excluded from all samples used for group-finding. 
Follow-up observations (C. Steidel, private communica- 
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Figure 1. Relative spectroscopic targeting and redsiiift-success 
rates of mock DEEP2 galaxies versus parent halo mass. Because 
of increased slit conflicts in crowded regions, galaxies in groups and 
clusters {Mf^alo > lO^^M©) are targeted for DEIMOS spectroscopy 
at a lower rate than galaxies like the Milky Way {Mf^alo ~ 10^^ Mq) 
(solid line). The eiTect is mild, however, and remains less than 
20% for all but the most massive clusters in the mock catalog. The 
sampling rate also falls for low-mass halos, since these contain faint 
galaxies, some fraction of which fall below the DEEP2 magnitude 
limit. Faint red galaxies are preferentially undersampled further 
because they are less likely to yield reliable redshifts; however, this 
effect is also mild and in any caes is limited mostly to galaxies in 
low-mass halos (dashed line). 

tion) show that roughly half of these redshift failues lie 
at z > 1.4, but the remainder serve to further reduce 
the DEEP2 sampling rate in the target redshift range. 
The redshift failure rate increases sharply for galaxies in 
the faintest half magnitude of the sample, and it is also 
boosted for red galaxies, since these tend to lack strong 
emission lines, making redshift identification more dif- 
ficult. One might expect that this would decrease the 
sampling rate preferentially in groups and clusters, which 
should have a large number of faint red satellite galax- 
ies. It is also possible to test this with the mock catalogs 
(which account for the color and magnitude-dependence 
of the incompleteness, as discussed in the next section). 
As shown in Figure [1] (dashed curve), redshift failures 
have a stronger effect on mock galaxies in low-mass ha- 
los (since they preferentially host faint galaxies) than in 
groups and clusters so that, if anything, the relative sam- 
pling rate of groups and clusters is boosted slightly by 
redshift failures. 

In any case, Figure [1] demonstrates the importance 
of having realistic mock catalogs on which to calibrate 
group-finding methods. Without accurate modeling of 
the selection probability for galaxies in massive halos 
relative to field galaxies, it will be difficult to have con- 
fidence in measures of group- finding success (e.g., the 
completeness and purity of the group catalog). In the 
next section, we will describe the mock catalogs we use 
to test our group-finding methods and optimize them for 
the DEEP2 catalog, focusing on the steps that have been 
taken to account for all of the different DEEP2 selection 



effects discussed above. 

2.1.1. The Extended Groth Strip 

Before we proceed, it is important to describe the 
somewhat different selection criteria that were used in 
one particular DEEP2 field, the Extended Groth Strip 
(EGS). This field is also the site of AEGIS, a large com- 
pendium of datasets spanning a broad range of wave- 
lengths, from X-ray to radio ([Davis et al.l I2007J) . To 
maximize the redshift coverage of these multiwavelength 
datasets, DEEP2 targets were selected without color 
cuts, so that galaxy spectra are obtained across the full 
redshift range < ^ < 1.4. However, spectroscopic tar- 
get selection used a probabilistic weighting as a func- 
tion of color, to ensure roughly equal numbers of targets 
at low and high redshift; this means that the sampling 
rate of galaxies will vary differently with redshift than 
would be expected in a simple magnitude-limited sam- 
ple. Furthermore, the EGS was observed with a different 
spectroscopic targeting strategy, so that each galaxy has 
four chances to be observed on different overlapping slit- 
masks. The overall sampling rate in the EGS is thus 
boosted somewhat relative to the rest of DEEP2. These 
differences in selection mean that it will be important to 
calibrate our group-finding techniques separately for the 
EGS and the rest of the DEEP2 sample. Our mock cat- 
alogs will therefore need to be flexible enough to account 
for the differences in selection between the EGS and the 
rest of DEEP2. 

2.2. DEEP2 mock catalogs 

The success of any group-finder will depend sensitively 
on the selection function of galaxies in halos of differ- 
ent masses, since this drives the observed overdensity of 
groups and clusters relative to the background of field 
galaxies. It is therefore crucial to test and optimize 
group-finding algorithms on simulated galaxy catalogs 
that capture and characterize this mass-dependent se- 
lection function as accurately as possible. It is help- 
ful to couch this discussion in the terminology of the 
halo model (e.g ., Peacoc k fc SmithI ,2000: Seljak JOOiJ 
iMa fc Fr"vll200ClD . particularly the halo occupation dis- 
tribution (HOD) N{M), which is the average number 
of galaxies meeting some criterion (usually a luminosity 
threshold) in a halo of mass M. What we would like 
is a mock catalog that correctly reproduces the HOD of 
observed galaxies, not just the HOD for galaxies above 
some luminosity cut, in group-mass halos. As discussed 
below, this will require us to improve upon the mocks we 
used for the initial DEEP2 group-finding calibration in 
G05. 

In that study, we optim ized the VDM g roup-finder 
using the mock catalogs of lYan et al.l ()2004l ) (hereafter 
YWG). Those authors produced mock DEEP2 cata- 
logs from a large-volume N-body simulation by adding 
galaxies to dark-matter halos according to a conditional 
luminosity function $(L|Af) whose form an d parame- 
ters we re chosen to be consistent with the iCoil et aD 
(j20041i ) galaxy autocorrelation function measured in 
early DEEP2 data. Since the HOD is directly linked 
to the correlation fimction, this implied that the HOD 
in the mocks was consistent with existing data. How- 
ever, the agreement between the high-redshift mock and 



measured correlatio n functions was m arginal at best, and 
later measurements (|Coil et al.ll2OO60 narrowed the error 
bars on the DEEP 2 correlation function so that the exist- 
ing mocks no longer agree with the data at high redshift. 
Indeed, direct modeling of the HOD from the DEEP2 
correlation function (Zheng et al. 200 7) is c^uite inconsis- 
tent with the HOD that was used in YWC. In particular, 
the YWC HOD had a power-law index of ~ 0.7 at high 
masses, while the HOD derived from DEEP2 data has 
a power-law index near unity. This suggests that the 
galaxy occupation of groups in the YWC mocks is quite 
different from that in the real universe. 

Another difficulty arises when we consider color- 
dependent selection effects. As discussed above, the 
DEEP2 magnitude limit translates into an evolving, 
color-dependent luminosity cut that also evolves with 
redshift, which may lead to preferential undersampling of 
groups and clusters. This is further complicated by the 
fact that the color-dens ity relation also evolves over the 
DEE P2 redshift range (jGerke et al.l 120071: iCooper et all 
|2007|) . Correct modeling of galaxy colors in the mock 
catalogs is therefore critical to proper calibration of our 
cluster-finding efforts. Unfortunately, the YWC mocks 
did not contain any color information, so any preferential 
color-dependent undersar npling of groups a nd clusters 
was not reflected there. iGerke et al.l ()2007[ ) addressed 
this problem by adding colors to the YWC mocks ac- 
cordi ng to the measure d DEEP2 color-density relation 
from ICooper et all (2006), but this did not address the 
inaccuracy of the underlying HOD. 

A final possible problem involves the choice of cosmo- 
logical background model used to construct the mock 
catalogs. The YWC mocks we used in G05 used N-body 
simulations calculated in a flat, ACDM cosmology with 
parameters ^Im — 0.3 and as — 0.9, both of which lie out- 
side the region of parameter space preferred by current 
data. Because changing these parameters has a signifi- 
cant impact on the halo abundance at 2; ~ 1, and because 
any realistic mock catalog will be constrained to match 
the abundance of galaxies, changes in the cosmology will 
necessarily have a substantial impact on the HOD. For 
example, a model with a higher (lower) cg will have a 
higher (lower) abundance of halos at any given mass, 
and thus will require a lower (higher) N{M) to match 
the observed galaxy abundance. This effect will be dis- 
cussed in more depth in the paper describing the new 
DEEP2 mocks (Gerke et al. in prep.), but here it will be 
important to assess its impact on group finding. 

Thus, as pointed out in YWC, it is important to update 
the mock catalogs to match DEEP2 more closely, now 
that a larger dataset is available. In this paper we make 
use of a new set of DEEP2 mock catalogs that remedy 
many of the inadequacies of the previous mocks. These 
mocks will be described in detail in a paper by Gerke 
et al. (in preparation); here we summarize the most 
important improvements over YWC for the purposes of 
group-finding calibration. 

The new mocks are produced from N-body simulations 
that have sufficient mass resolution to detect dark-matter 
halos and subhalos down to the mass range of dwarf 
galaxies with absolute magnitudes ~ M* + 10. This per- 
mits us to assign galaxies uniquely to dark-matter halos 
and subhalos over the full range of redshift and lumi- 



Table 1 

Summary of the simulations used to construct DEEP2 
mock catalogs. 



Simulation Box size '^ Fields 



n 



M 



0-8 



h 



Bolshoi 
L160 ART 
LI 20 ART 



250 
160 
120 



40 

12 
12 



0.27 0.82 0.7 
0.24 0.7 0.7 
0.3 0.9 0.73 



^ comoving h Mpc on a side. 

^ Number of mock 1 deg^ DEEP2 fields or 0.5 deg^ 

fields produced from each simulation. 

nosity covered by DEEP2, including the EGS. In order 
to investigate the impact of different cosmological mod- 
els on group finding, we have constructed mock cata- 
logs using three different simulations with three different 
background cosmologies that span the current range of 
allowed models; these are summarized in Table [TJ We 
use the mocks cons tructed from the Bolshoi simulation 
(jKlvpin et al.ll2010l ) as our fiducial model for quoting our 
main results, since its parameters are most consistent 
with current data, but we will use the other two cosmo- 
logical models to investigate the impact on our results 
of changes in the cosmological background. As discussed 
in Gerke et al. (in prep.), we construct light cones from 
these simulations, each having the geometry of a sin- 
gle DEEP2 observational field. To properly account for 
cosmic evolution, we stack different simulation timesteps 
along the line of sight, and we limit the number of light- 
cones we create for each simulation to ensure that the 
resulting mocks sample roughly independent volumes at 
fixed redshift. 

To add mock galaxies to these dark-matter-only light- 
cones, we use the so-called subhalo ab u ndance-matching 
appro ach (e.g., iConrov et al.l l2006t iVale fc Ostrikeri 
l2006f) to assign galaxy luminosities to dark-matter sub- 
halos identified in the simulations. Using the measured 
DEEP2 galaxy luminosity function (including its red- 
shift evolution) and simulated subhalo internal velocity- 
dispersion function, we map galaxy luminosities to sub- 
halos at fixed number density. By contrast, the dark- 
matter simulations used for the YWC mocks did not in- 
clude detections of dark-matter substructures to a suf- 
ficiently low mass, so galaxies were assigned to dark- 
matter halos stochastically from an HOD, with satellite 
galaxies assigned to randomly selected dark matter parti- 
cles. Our subhalo-based procedure should give a more ac- 
curate representation of the luminous p rofiles and galaxy 
kinematics of galaxy clusters than the lYan et all (|2004[ ) 
mocks. In addtion, the simulations used for the ear- 
lier mocks resolved halo masses sufficient to host central 
galaxies only down to ~ O.IL*. This made it impossible 
to create realistic mock catalogs for the EGS field, since 
this region includes faint dwarf galaxies at low redshits. 
Our new mocks resolve halos and subhalos to masses low 
enough to accommodate all DEEP2 galaxies except for 
a handful of ver y faint dwarfs at z < 0.05. 

IConrov et al.l (|2006( ) showed that the abundance- 
matching procedure reproduces the galaxy autocorrela- 
tion function at a wide range of redshifts, provided that 
the subhalo velocity function uses the subhalo velocities 
as measured at the moment they were accreted into larger 
halos. and for a particular choice of cosmological param- 
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eters that is now disfavored by the data. As discussed 
in Gerke et al (in prep.), however, for the more accu- 
rate cosmology used in Bolshoi, the abundance-matching 
approach does not reproduce the DEEP2 projected two- 
point function at z '--^ 1, lying some 20-40 % higher than 
the measurement from (jCoil et al.l 120061 ) . As we also 
discuss in that paper, the likely resolution to this dis- 
crepancy would involve an abundance-matching appoach 
that includes scatter in luminosity at fixed subhalo ve- 
locity dispersion, with larger scatter at lower dispersion 
values. This is likely to mainly impact the HOD at low 
masses, near the transition of N{M) between zero and 
unity, while causing minimal alteration in the HOD at 
group and cluster mas ses. Since the Bolsh oi mock HOD 
matches the measured iZheng et al.l (|2007D HOD well at 
these masses, we concluded that the clustering mismatch 
does not preclude using these mocks for group-finder op- 
timization. The overall occupation of group-mass ha- 
los in the mocks should represent the real universe well. 
What then remains is to account for the various obser- 
vational selection effects that translate this into an ob- 
served HOD for groups. 

To add galaxy colors to the mocks, we have followed an 
approach similar to the one used in IGerke et al.l (|2007D 
(which was itself inspired by the ADDGALS algorithm; 
Wechsler et al. in prep.). We assign a rest-frame U — B 
color to each mock galaxy by drawing a DEEP2 galaxy 
with similar redshift, luminosity, and local galaxy over- 
density While performing the color assignment, we must 
also account for galaxies that fall below the DEEP2 ap- 
parent magnitude limit. At fixed redshift redshift, there 
is some luminosity range in which the DEEP2 sample is 
partially incomplete, depending on galaxy color. In these 
luminosity ranges, we select galaxies for exclusion from 
the mock catalog depending on their local density, until 
the local density distribution in the mock is consistent 
with the measured distribution in DEEP2. This tech- 
nique effectively uses local galaxy density as a proxy for 
color and ensures that the impact of the DEEP 2 selec- 
tion function on the sampling of galaxy environment is 
accurately reproduced in the mocks. Full details of the 
color-assignment algorithm (which are somewhat com- 
plex and beyond the scope of this discussion) can be 
found in the paper describing the mock catalogs (Gerke 
et al. in preparation). 

After assigning rest-frame colors, we then assign ob- 
served apparent R-han d magnitudes by inve rting the k- 
correction procedure of lWillmer et al.l (|2006l ): this proce- 
dure accurately reproduces the evolving, color-dependent 
luminosity cut that is imposed by the DEEP 2 magnitude 
limit, as well as the color-density relation, so any under- 
sampling of groups and clusters owing to color-dependent 
selection effects should also be captured in these mocks. 

As we did in G05, to simulate the effects of DEEP2 
spectroscopic target selection we pass our mock cata- 
logs through the same slitmask-making algorithm that 
was used to schedule objects for DEEP2 observations 
(jDavis et al.l 120041 : iNewman etH.! 120121) . The DEEP2 
color cuts do not give a completely pure sample of 
high-redshift galaxies, so the pool of mock targets for 
maskmaking also includes foreground {z < 0.75) and 
background {z > 1.4) galaxies, as well as randomly posi- 
tioned stars, in proportions that are consistent with those 
found in the DEEP2 sample. To make mocks of the EGS 



field, we use the somewhat different target-selection al- 
gorithm that was used for the EGS, including galaxies 
at all redshifts, but giving higher selection probability 
to galaxies at z > 0.75 in a manner that reflects the 
color-dependent weighting applied to the real EGS. Any 
density-dependent effects on the sampling rate that are 
driven by slit conflicts should therefore be fully accounted 
for in the mocks. 

As a final step, we must replicate the effects of DEEP2 
redshift failures, as a function of galaxy color and magni- 
tude. To do this, we utilize t he incompleteness-cor rection 
weighting scheme devised bv lWillmer et al.l (|2006[ ). This 
scheme assigns a weight to each galaxy according to 
the fraction of similar galaxies (in observed color-color- 
magnitude space) that failed to yield a redshift. When we 
add colors to the mock galaxies by selecting galaxies from 
the DEEP2 sample, we also assign each mock galaxy the 
incompleteness weight Wi of the DEEP 2 galaxy we have 
drawn (with some small corrections, described in Gerke 
et al. in preparaion). Although this was intended to 
correct for redshift incompleteness in the data, it can be 
inverted to produce incompleteness in the mock: after we 
have selected targets with the DEEP2 slitmask-making 
algorithm, we reject ~ 30% of these targets, with a re- 
jection probability given by 1/wi. This procedure natu- 
rally reproduces any dependence of the DEEP2 redshift- 
success rate on galaxy color and magnitude. 

These mock catalogs accurately reproduce a wide range 
of statistical properties of the DEEP2 dataset (Gerke et 
al. in prep.). Most importantly for group-finding efforts, 
though the mocks match (1) Th e HOD at group ma sses 
(M > 5 X 10^2), as measured in IZheng etHI 1)20071 ) for 
several different luminosity thresholds, /^^j the evolving 
color-densi ty relation that was measured in lCooper et al.l 
(2006) and lCooper e t al.l (|2Q03), and (3) the redshift dis- 
tribution of the DEEP2 data. These three points of 
agreement should be sufficient to ensure that the ob- 
served DEEP2 HOD for group-mass halos is accurately 
reproduced by the mocks. We can thus proceed with 
confidence in using these mocks to optimize our group- 
finding techniques. 

2.2.1. The effects of DEEP2 selection on the observed group 

population 

First, though, it will be interesting to use the mocks 
to investigate the impact of observational effects on the 
galaxy population of massive halos in DEEP2. (We also 
explored this in some detail in G05; see Figures 2 and 3 
of that paper). Figure [H summarizes the impact of the 
various DEEP2 selection effects on galaxies in massive 
dark matter halos in a narrow slice through a mock cat- 
alog, which contains the most massive high-redshift halo 
in the mocks (this region is depicted in projection on 
the sky, before and after selection, as the colored points 
in the upper left and right panels, respectively). There 
are three primary selection effects that remove galaxies 
from the mock sample. In the figure, these selections 
are depicted visually by vertical lines across the main 
panel, and galaxies' paths through the selection pro- 
cess are shown by horizontal lines running from left to 
right, with group- mass halos indicated by gray horizon- 
tal bands. First, the DEEP2 R = 24.1 magnitude limit 
removes faint galaxies, with red galaxies being excluded 
at brighter luminosities than blue ones. DEIMOS target 
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Figure 2. The dilution of a small subregion of a DEEP2 mock catalog by observational selection effects. Galaxies have been selected 
from a small subregion of a DEEP2 mock catalog, roughly 4 arcmin in RA by 30 arcmin in Dec with a redshift depth of 0.05. This 
strip is indicated in projection on the sky by the colored points in the panels at top left (before selection) and after selection at top right 
(afterward). Galaxies on the red sequence (i.e. redder than the red-blue divide given in Willmer et al. (2006)) are indicated in red, and blue 
galaxies are shown in blue. In order to show the impact on cluster selection, this narrow slice in RA, Dec and redshift was chosen to contain 
the most massive high-redshift cluster in the mock catalogs (a 4.7 X 10^*Mq object at z = 0.8). The main panel is a schematic diagram 
of these galaxies' path through the DEEP2 selection process. At left, we begin with horizontal lines (arranged vertically in order of the 
declination coordinate) representing all galaxies in this subregion more luminous than Mg = —17.6. Each vertical grey line represents a 
step in the DEEP2 selection procedure; galaxies are excluded from the sample by the R = 24.1 apparent magnitude limit, by the targeting 
procedure for assigning galaxies to DEIMOS slits, and by failures to obtain good redshifts for some observed galaxies. Horizontal gray 
bands in the main panel indicate the spatial extent of the four most massive halos in this small region (note that the colored lines within 
these gray bands are not necessarily all members of these halos, owing to projection effects). The masses of the halos are indicated, as are 
their richnesses before and after dilution by DEEP2 selection processes. The bottom panels show the aggregate impact of galaxy selection 
effects on the halo and group population. The lower half of each panel shows the mass function of halos containing one or more galaxy 
in each sample (solid lines) and the mass function of groups with two or more members (dashed lines). The group population selected in 
DEEP2 spans a very broad range in mass and represents an incomplete halo sample at all but the very highest masses. The upper half of 
each panel shows the relation between halo mass and measured group velocity dispersion a^'^ for all groups with two or more members. 
The mean relation remains approximately constant, although the scatter increases since there are fewer galaxies sampling the velocity field. 



selection then removes a random subsample of the re- 
maining galaxies, with some preferential rejection occur- 
ring in massive halos. Finally, some galaxies fail to yield 
redshifts, further diluting the sample. The impact of this 
dilution on the population of galaxies in groups can be 
quite strong: the most massive halo shown in the main 
panel loses some 60% of its members. It also introduces 
an added degree of stochasticity into the mass-selection 
of halos. The least- massive halo shown in the figure con- 
tains two observed galaxies, and would be identified as 
a group, while the next most massive halo contains only 
one observed galaxy, so it would be identified as an iso- 
lated galaxy. 

The lower panels in Figure [D show the effect on the 
mass functions of observed galaxies and groups. DEEP2 
selection effects mean that the sample of systems with 
two or more observed galaxies will only be a com- 
plete sample of massive halos at relatively high masses 
> 5 X 10^'^Aff^. However, the cutoff in the mass selec- 



tion function for groups is quite broad, owing to the 
stochastic effects mentioned above, so that even halos 
with M < IO^^Mq have some chance of being identified 
as groups. 

The lower panels also show the effect of DEEP 2 se- 
lection on the relation between halo mass and observed 
group velocity dispersion (for systems with two or more 
galaxies at each stage). As expected, the scatter in this 
relation increases as we move through the selection pro- 
cess, since the number of galaxies sampling the velocity 
field is reduced. However, a clear correlation remains 
between the mass of a halo and the dispersion a^^^ of 
its galaxies' peculiar velocities. It should therefore be 
possible, at least in principle, to use a DEEP2 group 
catalog to measure the halo mass functio n and constrain 
cosmological parameters, as proposed in iNewman et al.l 
([2002) , provided that the halo selection function imposed 
by DEEP2 galaxy selection can be understood in detail. 
In addition, it would be necessary to carefully account for 



the increased scatter in the M-af^"^ relation imposed by 
selection effects. We describe a computational approach 
to achieving this in the Appendix. 

3. CRITERIA FOR GROUP-FINDER OPTIMIZATION 
3.1. Group-finding terminology and success criteria 

The aim of our group-finding exercise is to identify sets 
of galaxies that are gravitationally bound to one another 
in common dark-matter halos. A perfect group catalog 
would identify all sets of galaxies that share common ha- 
los and classify them all as independent groups, with no 
contamination from other galaxies, and no halo members 
missed. Any realistic algorithm for finding groups in a 
galaxy catalog, however, is subject to various sources of 
error that cannot be fully avoided, owing largely to in- 
completeness in the catalog an d ultimately to the nois e 
inherent in any discrete process (jSzapudi k, Szalavll996D . 
Any individual type of error can typically be reduced 
to some extent by varying the parameters of the group- 
finder, but this often comes at the expense of increases 
in other kinds of error. The classic example of this is the 
trade-off between merging neighboring small groups to- 
gether into spuriously large groups on the one hand and 
fragm enting large groups into sm aller subclumps on the 
other fN olthenius fc White! ITQSTI ). 

Because there are inevitably such trade-offs between 
various different group-finding errors, it is important to 
define clearly the criteria by which group-finding success 
is to be judged and the requirements for an acceptable 
group catalog. As discussed by G05, the optimal bal- 
ance between different types of error will depend on the 
particular scientific purpose to be pursued by study of 
the groups. In the present study, our primary goal is 
to produce a group catalog that accurately reconstructs 
the abundance of groups as a function of redshift a nd ve- 
locity dispersion, N{a-, z). As discussed in Newma n et al.l 
([2002) , such a catalog can be used to place constraints on 
cosmological parameters. Therefore, our optimal group 
catalog will be the one that most accurately reconstructs 
A^(i7, z). It is also of interest to use the grou p catalog for 
studi es of galaxy evolution in groups (e.g.. iGerke et al.l 



2007fl or of the evolution of group scaling relations {e.g., 
Jeltema et al.l (2009)); a catalog that can be used for 
those purposes is a secondary goal. These two goals will 
drive our choice of metrics for group-finding success in 
what follows. 

3.1.1. What is a group? 

In tests using mock catalogs, the "true" group catalog 
is known, and we are using our group-finding algorithm 
to produce a "recovered" group catalog; this leads to 
potential ambiguity in the meaning of the word group. 
To distinguish clearly between the two c a.ses, we adopt 
termin ology similar to that employed by iKoester et al.l 
()2007() . For the purposes of discussing group- finding in 
the mocks, a group is defined to be a set of two or more 
galaxies (the group members) that are linked together by 
a group-finding algorithm. Galaxies that are not part of 
any group are called field galaxies. By this definition, a 
group is not necessarily a gravitationally bound system; 
rather it is exactly analogous to a group in the real data. 
By constrast, a halo, for the purposes of discussing group 
finding, is defined to be a set of galaxies in the observed 



mock (the halo members) that are all actually bound 
gravitationally to the same dark-matter halo in the back- 
ground simulation [3- It is possible to have a halo that 
contains only a single galaxy; such galaxies (and their 
host halos) are called isolated and are analogous to field 
galaxies in the group catalog. By comparing the set of 
groups to the set of non-isolated halos in the mock cata- 
log, then, it will be possible to judge the accuracy of the 
group- finder. 

It will also be useful to distinguish between the intrin- 
sic properties of halos (e.g., the total richness, or number 
of halo members above some luminosity threshold) , the 
observable properties of halos (e.g., the observable rich- 
ness, or total number of halo members that are in the 
mock catalog after DEEP2 selection has been applied), 
and the observed properties of groups (e.g., the observed 
richness, or total number of group members). Unless 
otherwise specified, we will always discuss the properties 
of groups and halos as computed using their member 
galaxies: for example, the velocity dispersion of a halo 
will always be the dispersion of the halo members' veloc- 
ities, erf , rather than the dispersion of the dark-matter 
particles, cr^^, unless we explicitly specify that we are 
talking about a dark-matter dispersion. 

3.1.2. Success and failure statistics: basic definitions 

There are two primary modes of group-finding failure, 
for which we will adopt the same terminology used in 
G05. Fragmentation occurs when a group contains a 
proper subset of the members of a given halo, while over- 
merging refers to a case in which a group's members in- 
clude members of more than one halo. A special case of 
overmerging involves isolated galaxies that are spuriously 
included in a group; such galaxies are called interlopers. 
It is also possible for fragmentation and overmerging to 
occur simultaneously, as when a group contains proper 
subsets of several different halos. 

Fragmentation and overmerging are generally likely to 
lead to a wide diversity of errors when a group catalog 
is considered on an object-by-object basis, so it will be 
useful to define a set of statistics that summarize the 
overall quality of the catalog. Here we will adopt the 
statistics used in G05 (with one addition, /noniso), which 
can be summarized as follows. On a galaxy-by-galaxy 
level, we define the galaxy success rate Sg^i to be the 
fraction of non-isolated halo members that are identified 
as group members. Conversely, the interloper fraction 
/int is the fraction of identified group members that are 
actually isolated galaxies. It is also worth considering 
the quality of the field galaxy population, since a perfect 
group finder would leave behind a clean sample of iso- 
lated galaxies. We therefore also compile the non-isolated 
fraction /noniso, which is the fraction of field galaxies that 
are actually non-isolated halo members. On the level 
of groups and halos, we define two different statistics. 
Broadly speaking, the completeness C of a group cata- 
log is the fraction of non-isolated halos that are detected 
as groups, while the purity P is the fraction of groups 

^^ The assignment of mock galaxies to halos of course depends 
on the simulation, halo-finding, and mock-making algorithms we 
employ; we discuss this futher in the paper describing the mocks 
(Gerke et al. in prep.). For the purposes of this study, though, the 
galaxy-halo assignment can be taken as "truth", since the choice 
of algorithms has already been made. 



that correspond to non-isolated halos. In general, the 
classic trade-offs inherent in group-finding are evident in 
these statistics: changes to the group finder that improve 
completeness or galaxy success will typically have nega- 
tive effects on purity and interloper fraction. 

Attentive readers will notice here that we have not yet 
defined what it means for a halo to be "detected" or for 
a group to "correspond" to a halo, so the meanings of 
of the terms completeness and purity are still unclear. 
These definitions, which are somewhat subtle, are the 
subject of the following sections. 

3.1.3. Matching groups and halos 

In order to compute the completeness and purity of 
a group catalog we must first determine a means for 
drawing associations between groups and halos. In 
the case of groups identified in a mock galaxy cat- 
alog, the most natural way to do this is consider 
the overlap between the groups' and halos' members. 
This basic approach has been u sed with good s uccess 
in many previous s tudies (e.g., fEke et 3,1.112004 G05, 
Kocstcr ct al.' ' 20071: iKn obcl ct al.l 120091: ICucciati et al.l 
2010: Soares-Santos et al. 201 liV We associate each 
group to the non-isolated halo that contains a plural- 
ity of its members, if any such halo exists (otherwise the 
cluster is a false detection). Similarly, we associate each 
non-isolated halo to the group that contains a plurality of 
its members (again if any such group exists). In the case 
of ties, e.g., when two halos contribute an equal number 
of galaxies to a group (an example of overmerging) , we 
choose the object that contains the largest total number 
of galaxies, or, if this is still not unique, the one with the 
largest observed velocity dispersiorQ- Hereafter, we will 
use the term Largest Associated Object (LAO) to refer 
to the group (halo) that contains the plurality of a given 
halo's (group's) members. 

This matching procedure is rather lenient and is by no 
means unique: a group can in principle be associated to 
a halo with which it shares only a single galaxy, multiple 
groups can be matched to the same halo (and vice- versa), 
and a cluster may be associated to a halo that is itself 
associated to some other cluster For example, if a halo 
H with five members is divided into two groups, Gi with 
three members and G2 with two, then Gi and G2 are 
both associated to H, but H is only associated to the 
larg er of the two g roups, Gi (see Figure 4 of G05 or Fig. 
3 of lKnobel et all (|2009.) for depictions of other compli- 
cated associations) . This example also illustrates the dif- 
ference between one-way and two-way associations: Gi 
is associated with H, and vice- versa, so this is a two-way 
match; however, G2 is associated with H, but the reverse 
is not true, so this is a one-way match. 

In G05, we used a more stringent matching criterion 
that made an association only when the LAO contained 
more than 50% of the galaxies in a given group or halo. 
This definition has the virtue of removing the need to 
break ties between possible LAOs, but it is somewhat 
problematic in the case of low-richness systems. If, for 
example, a halo containing four galaxies had two of its 
members assigned to the same group by the group- finder, 
with the other two being called field galaxies, the G05 

^^ we would choose randomly if both tie-breaker criteria failed, 
although this does not occur in practice 



criterion would class the group as a successful detection 
but would deem the halo to be undetected. Because of 
situations like this, we choose here to separate questions 
of simple group detection from issues of group-finding 
accuracy. In order to assess the latter, we also compute 
the overall matching fraction f of each group-halo asso- 
ciation: the fraction of galaxies in a given system (group 
or halo) that are contained in its LAO. In what follows, 
we will use this fraction to consider more and less strin- 
gent limits on accuracy when computing completeness 
and purity statistics. 

3.1.4. Purity and completeness 

To compute purity and completeness, it will be neces- 
sary to define the criteria by which a group-halo associ- 
ation constitutes a "good" match, to be counted toward 
these statistics. In general we will count associations 
above some threshold in /, and we will compute sepa- 
rate purity and completeness values for one-way and two 
way matches. We will represent these various purity and 
completeness statistics using the symbols ™P/ and ™G/, 
where we are only counting associations with match frac- 
tions larger than /, and w = 1 ot w ^ 2 indicates that 
we are counting one-way or two-way associations. 

The simplest statistics to use are ^Pq and -^Cq, which 
denote the fraction of groups and halos, respectively, 
that have any associated object whatsoever, regardless 
of match fraction or match reciprocity. These values are 
good for getting an overall sense of the group-finder's 
success at making bare detections of halos, but their use- 
fulness is somewhat limited since, for example, one could 
achieve ^Gq =^ -Po = 1 simply by placing all galaxies into 
a single enormous group (in this case, all halos would be 
associated to the group, and the group would be associ- 
ated to the largest halo). A more useful pair of statistics 
is ^Gq and ^Pq, the fractions of halos and groups that 
have two-way associations, regardless of match fraction. 
These tell us the fraction of halos that were detected 
without being merged with a larger halo and the frac- 
tion of groups that are not lesser subsets of a fragmented 
halo. In the pathological all-inclusive cluster example 
above, ^Pq = 1, but ^Gg is near zero, indicating a prob- 
lem. 

This also illustrates the usefulness of comparing one- 
way and two-way completeness and purity statistics for 
diagnosing problems with a group finder. If ^Gq is sub- 
stantially larger than ^Gq, for example, then a significant 
fraction of detected halos must have been merged into 
larger systems, so overmerging is a significant problem. 
Conversely, if ^Pq is much larger than ^Pq, then there 
must be substantial fragmentation in the recovered cat- 
alog. It will also be interesting to consider completeness 
and purity statistics using different values for /, such as 
■^Gso and ^Pso, which were used in G05. As discussed 
above, however, using more stringent matching-fraction 
thresholds can give an overly pessimistic impression of 
the overall detection success. For our main assessment 
of overall completeness and purity, then, we will use ^Gq 
and ^Pq, since these statistics use the broadest possible 
definition of a "good" match that does not count frag- 
ments and overmergers (beyond the largest object in each 
fragmented or overmerged system) as successes. 

3.1.5. The velocity function of groups 
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In addition to considering the detection efficiency of 
the group finder on a system-by-system basis, for some 
science applications one may also be interested in vari- 
ous properties of the group catalog as a whole. In the 
case of DEEP2, it has been shown ([Newman et al.|[200^ 
that the bivariate distribution of groups as a function of 
redshift and velocity dispersion, dN{z)/dav, can be used 
to constrain cosmological parameters, since it depends 
on the volume element V{z) and on the evolving group 
velocity function dn(z)/da^, both of which depend on 
cosmology. Marinoni et al. (2002) and G05 have shown 
previously that the VDM groupfinder can accurately re- 
construct this distribution in high-redshift spectroscopic 
surveys. 

In this study, we will use the reconstuction of the veloc- 
ity function as a second measure of group-finding success. 
After we have optimized the completeness and purity of 
our groupfinder, we will further optimize the group-finder 
to reconstruct dN {z) / da^ as well as is possible without 
sacrificing completeness or purity. In practice, this boils 
down to comparing the number counts of groups and ha- 
los in bins of z and a^. Since the distribution is quite 
steep in CTv, it will be important to take some care in 
our choice of binning. We discuss these details below in 
Section [nH] 

4. THE GROUP-FINDING ALGORITHM 
4.1. The Voronoi-Delaunay group finder 

The Voronoi-Delaunay method (VDM) group finder is 
an algorithm for detecting groups of galaxies in redshift 
space from spectroscopic survey data. It has advantages 
over the usual Friends-of-Friends (FoF) approach in very 
sparsely sampled datasets, when the linking lengths re- 
quired for FoF group-finding become larger than typical 
group sizes (for a more detailed discussion of this point, 
see G05). VDM makes use of the local density informa- 
tion that is obtained by computing the three-dimensional 
Voronoi tesselation and Delaunay mesh of the galaxies in 
redshift space. The Voronoi tesselation is a unique par- 
titioning of space about a particular set of points (the 
galaxies in this case), in which each point is assigned 
to the unique polyhedral volume of space (the Voronoi 
cell) that is closer to itself than to any other point. The 
Delaunay mesh is the geometrical dual of the Voronoi 
tesselation and consists of a network of line segments 
that link each point to the points in immediately adja- 
cent Voronoi cells. Galaxies that are directly linked by 
the Delaunay mesh are called first- order Delaunay neigh- 
bors, neighbors of neighbors are second- order Delaunay 
neighbors, and so on. 

The VDM algorit hm was first described by 
IMarinoni et al.l (j2002[ ). who showed that it could 
be used to detect galaxy groups in a DEEP2-like survey. 
In particular, they showed that the VDM algorithm 
could be tuned to accurately reconstruct the distribu- 
tion of groups as a function of velocity dispersion a^ 
and redshift z, above some threshold in a^\ this was 
confirmed by G05, who produced a preliminary DEEP2 
group catalog using a version the VDM algorithm. 
VDM has also been applied successfully to the VVDS 
(ICucciati et al.l [20lol) and zCOSMOS (IKnobel et al.l 
12009) redshift catalogs. Readers are referred to G05 
and iMarinoniet al., (,20021 ) for detailed descriptions of 



the algorithm we will be using in this study. Here, we 
summarize the basic algorithm and the differences from 
the version we used in G05. 

After computing the Voronoi tesselation and Delaunay 
Mesh for a given galaxy sample, the VDM algorithm pro- 
ceeds in three phases. In Phase I, the galaxies are first 
sorted in increasing order of their Voronoi cell volume, 
a time-saving step which ensures that group-finding is 
attempted in very dense regions first. Then, proceeding 
through this sorted list in order, we consider each galaxy 
in turn as a "seed" galaxy for a galaxy group, provided 
that it has not already been assigned to a group. A 
cylindei[3 is drawn around each seed galaxy with radius 
7?,mi„and length 2Cmin- If that cylinder contains any 
first-order Delaunay neighbors of the seed galaxy, they 
are deemed to be part of a group, and the algorithm pro- 
ceeds to Phase II. If no first-order neighbors are found 
in the Phase I cylinder, no group is detected, and the 
algorithm proceeds to the next galaxy in the list. 

In Phase II, a larger cylinder is defined around the 
seed galaxy, with radius TZu and length 2£ii. We count 
the number of galaxies in this cylinder that are first or 
second-order Delaunay neighbors of the seed galaxy, de- 
noting this number by iVii. Since the number density of 
observed galaxies varies with redshift, we correct Nu by 
the ratio of number density of DEEP2 galaxies at z = 0.8 
to the local number density at the group redshift. The 
number density is computed by smoothing the DEEP2 
redshift distribution and dividing by the comoving cos- 
mological volume element. 

The corrected value, Nff", is taken as an initial es- 
timate of the size of the group and is used to scale the 
final search cylinder in Phase III. The Phase III cylinder 
is centered on the barycenter of the Phase II galaxies and 
has radius TZm = max(r x {Nf{'"y/'^,TZmin) and length 
Mil = max(£ X {Nf{'")^^^,£rnin), with r and £ being the 
Phase III parameters of the algorithm. All galaxies that 
fall within the Phase III cylinder are deemed to be mem- 
bers of the group. The algorithm then continues to the 
next galaxy in the list that has not yet been assigned to 
a group and repeats the procedure. 

The VDM thus has six tuneable parameters (two for 
the search cylinder in each of the three phases) that must 
be optimized for a particular survey. These are not fully 
independent, however. For example, an increase in the 
size of the Phase II cylinder will increase the typical A'n 
values and so can be offset by a decrease in the Phase III 
r and £ parameters. Furthermore, our group-finding ex- 
ercise (indeed, any group-finding exercise) can be concep- 
tually subdivided into two steps: group detection, which 
occurs in Phase I alone, and membership assignment, 
which occurs in Phases II and III. The parameters that 
control each of those steps can be tuned more or less in- 
dependently of one another on the way to determining 
an optimum set of group-finding parameters. 



^* All VDM cylinder dimensions are comoving distances and are 
converted to angular and redshift separations by assuming a flat 
ACDM cosmology with Qm = 0.3. This cosmology is assumed re- 
gardless of the true background cosmology when running on mock 
catalogs, since it is what we assume when running on the DEEP2 
dataset, to allow consistency with previous DEEP2 studies, partic- 
ularly G05. It is straightforward to rescale the cylinder dimensions 
to different assumed background cosmologies. 
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4.2. Changes to the G05 Algorithm 

Before we leave discussion of the VDM algorithm, it is 
important to make note of a few minor changes that we 
have made to the VDM algorithm we used in G05. First, 
we have used a redshift of 0.8 as a reference for correcting 
A^ii, since z = 0.8 is near the peak of the DEEP2 redshift 
distribution, in contrast to the G05 reference value, z = 
0.7, where the redshift distribution is rising sharply in 
the main DEEP2 sample. 

We also made some important changes to the 
membership-assignment part of the algorithm. In G05 
each group included all galaxies identified in either Phase 
II or Phase III of the VDM algorithm, regardless of 
whether or not the Phase III cylinder was larger than 
the Phase II cylinder. This meant that the Phase II 
cylinder dimensions had to be kept relatively small, so 
as not to swamp small groups with interloper field galax- 
ies. In testing the VDM on our new mock catalogs, we 
found that this led to significant fragmentation of larger 
groups: the Phase II cylinder was too small to accurately 
estimate their richnesses, so the Phase III cylinder was 
significantly too small to include all their members. 

To some degree, this is unavoidable in a sparsely sam- 
pled survey, but we found that we were able to mitigate 
it by allowing the Phase II cylinder to be quite large, sim- 
ilar in scale to a massive cluster. To gain this advantage 
while avoiding problems in smaller groups, we decided 
not to include Phase II galaxies in the final group mem- 
berships. That is, we use the Phase II cylinder to get 
a rough estimate of the number of galaxies in the group 
by drawing a cylinder that is typically too large and will 
pick up all the group members and possibly some field 
galaxies. The scaled Phase III cylinder then refines this 
estimate and will frequently select only a subset of the 
Phase II galaxies for the final group. In practice, with a 
very large Phase II cylinder, the A'n counts often simply 
include all second-order Delaunay neighbors, with the 
cylinder simply setting a maximum distance at which 
such neighbors will be considered. For this reason, we 
find that varying the Phase II cylinder at relatively large 
sizes has negligible impact on our results. We thus focus 
mainly on optimizing the Phase I and III parameters in 
what follows. 

4.3. Considerations for the ECS 

Because the galaxies targeted in the EGS cover a very 
broad redshift range with a fixed apparent magnitude 
limit, the range of galaxy luminosities being probed 
varies dramatically from low to high redshift, with only 
very bright {L > L*) galaxies being observed at z > 1 
but extremely faint dwarfs included at low redshift. The 
presence of these introduces some complications into the 
group-finding process. The first has to do with the sim- 
ple definition of a "group." In the main DEEP 2 sam- 
ple, groups are systems containing on the order of a few 
Milky- Way-sized galaxies at least. At low z in the EGS, 
by contrast, we will also be capable of detecting systems 
consisting of a single Milky- Way-sized galaxy and a few 
dwarfs similar to the Magellanic Clouds. Arguably we 
should not categorize the latter systems as groups at all. 

More importantly, the faint, low-z dwarfs present a 
challenge for optimizing the VDM group-finder. Phases 
I and II of the VDM algorithm search for galaxies that 



are connected to a given seed galaxy by one or two links 
in the Delaunay mesh. Using Delaunay connectedness 
in this way as a means of detecting groups of bright 
galaxies rests on the assumption that group members of 
similar luminosity are likely to be Delaunay neighbors. 
When the much galaxies are included, this assumption 
may break down, since dwarfs are much more numer- 
ous than galaxies near L*, and so it is possible that a 
bright galaxy's local Delaunay mesh may be "saturated" 
by dwarfs, cutting off any links to neighboring bright 
objects and preventing detection of the larger group. In- 
deed, in our initial experiments with mock EGS cata- 
logs, we found that it was impossible to achieve satis- 
factory performance with the VDM group-finder at both 
low and high redshift simultaneously if the entire EGS 
galaxy sample was used. 

If we choose to focus our group-finding efforts on sys- 
tems containing multiple bright galaxies, as in the main 
DEEP 2 sample, then fortunately there is a simple way of 
addressing both of the above issues by limiting Phases I 
and 11 of the group finding to bright galaxies only. In par- 
ticular, when computing the Voronoi partition and De- 
launay mesh in the EGS, we can restrict the low-redshift 
(z < 0.8) sample to only those galaxies that are luminous 
enough that they could have been observe d at z > 0.8. 
To do this, we follow the procedures used in lGerke et al.l 
(|2007f ). who defined a set of diagonal cut s in the DEEP2 
rest-frame (i.e., fc-corrected as in Will mer et al.l [2C)06( ) 
color-magnitude space, which correspond to the DEEP2 
R = 24.1 apparent magnitude limit at different redshifts 
(see Figure 2 of that paper). If we define such a cut that 
traces the faint-end limit of DEEP2 galaxies in color- 
magnitude space at z = 0.8, we can then select only 
galaxies brighter than this limit at lower z; these are the 
low-redshift analogs of the main DEEP2 sample. (When 
performing this selection, we also evolve the cut toward 
fainter magnitudes at lower redshifts , according to the 
evolution of L* that was obtained in iFaber et al.l 120071 
namely a linear evolution of 1.2 magnitudes per unit z). 

For EGS groupfinding, we apply this selection to the 
z < 0.8 galaxy population before computing the Voronoi 
and Delaunay information, and we consider only the 
selected galaxies in Phases I and II of the algorithm. 
This means that only systems containing at least two 
bright galaxies (that would be observable at z > 0.8) 
will be counted as groups. In Phase III, however, we con- 
sider all galaxies regardless of luminosity, since this final 
membership- assignment step simply counts all galaxies 
in the Phase 111 cylinder, without reference to the Delau- 
nay mesh. This approach to group-finding in the EGS 
has the virtue of ensuring that the groups in the EGS 
have similar selection, while also counting dwarf mem- 
bers of the groups where they have been observed. 

5. OPTIMIZATION ON DEEP2 MOCK CATALOGS 

Tthe VDM algorithm has six free parameters whose 
optimal values are not immediately obvious. It is thus 
very important to test the algorithm on simulated data 
that reproduce the properties of the real data as accu- 
rately as possible. As discussed above in Section [2?2| the 
mock catalogs developed in Gerke et al. (in prep.) accu- 
rately reproduce a wide array of the observed properties 
of the DEEP2 catalog, including color-dependent selec- 
tion effects that might disproportionately impact galax- 
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ies in groups relative to those in the field. Testing the 
VDM groupfinder on these mocks will thus represent a 
significant improvement over the group-finding effort in 
G05, which made use of mocks that lacked such color- 
dependent effects. The current mocks also have been 
constructed for three different cosmological background 
models, one of which, for the Bolshoi simulation, is very 
close to the model that best fits current data. 

In practice, we optimize the group-finding parameters 
by running the VDM group finder repeatedly on the 
mock DEEP 2 observational fields, allowing the group- 
finding parameters to vary over a wide range in the six- 
dimensional parameter space, and looking for parame- 
ter sets that meet our optimization criteria. Since the 
Bolshoi simulation cosmology is in the best agreement 
with present data, we use this simulation to perform the 
main optimization. However, we repeated this procedure 
on each of the three different sets of mock catalogs de- 
scribed in Section 12.21 (see Table [T]) to test whether and 
to what degree the optimal parameter set depends on the 
background cosmology. 

For our purposes, the optimal set of group-finding pa- 
rameters will be the one that most accurately recon- 
structs the velocity function, as measured using the ve- 
locities of the observed galaxies, while also stiking the 
best possible balance between the purity and complete- 
ness of the group catalog. It is not immediately obvi- 
ous that all of these requirements can be met simul- 
taneously within the six-dimensional VDM parameter 
space. However, the steps in the VDM algorithm di- 
vide rather cleanly into a group-detection step (Phase 
I) and a membership-assignment step (Phases II and 
HI). Since our purity and completeness statistics are 
mostly a test of group-finding success, whereas veloc- 
ity dispersion measurements depend on assigning the 
right galaxies to the right groups, it is reasonable to 
supose that the two success criteria may be optimized 
at least semi-independently. Indeed, experimentation re- 
veals that completeness and purity are only weakly cou- 
pled to the shape of the recovered velocity function, at 
least near the optimum of the purity and completeness 
values: here, purity and completeness depend mostly on 
the Phase I parameters of VDM, while the velocity func- 
tion reconstruction is mainly governed by Phase HI. 

In the following, then, we optimize purity and com- 
pleteness first and then consider the velocity function. 
Additionally, as in G05, we identify a high-purity pa- 
rameter set, for which the purity of the catalog is nearly 
maximized, at the expense of completeness. We will use 
this when constructing the DEEP2 and EGS group cat- 
alogs to identify a subset of groups that should be con- 
sidered higher-confidence detections than the rest. 

5.1. Purity and completeness 

Figure [3] shows the purity and completeness statistics 
^Pq and ^Co that we obtained for widely varying choices 
group-finding parameters in each of the different mock 
cosmologies. Each data point in the Figure represents 
the completeness and purity values (computed over all 
mock lightcones for each cosmology) that we obtained 
for a given set of group-finding parameters and mock 
cosmology. Results are shown for both the main DEEP2 
mocks and the EGS mock catalogs. A diagram like this 
is a very useful visualization tool for group-finder opti- 



mization; it is similar in spirit to Figure 4 of lKnobel et al.l 
(|2009D . The fundamental trade-off between completeness 
and purity is readily apparent in the Figure: an increase 
in completeness is always accompanied by a decrease in 
purity, and vice- versa. 

For each mock cosmology depicted in the Figure, dis- 
tinct clusters of datapoints are apparent. In all cases, 
each individual cluster corresponds to a different value 
of the Phase I parameter TZmin] the impact of vary- 
ing the other five VDM parameters (rather widely in 
many cases) is confined to the area of each cluster of 
points. (The range of 7?.mm values considered in the fig- 
ure is 0.1 <TZmin< 0.5 Mpc.) The obvious conclusion 
is that, for the purposes of optimizing simple group de- 
tection, TZmin^s by far the most important parameter, 
with all other parameters having a comparatively negli- 
gible effect on detection efficiency. In general, increasing 
7?,mi„ improves the completeness statistic while degrading 
the purity (and vice- versa). It is apparent from the Fig- 
ure, however, that improvements in either completeness 
and purity are eventually subject to diminishing returns: 
at sufficiently low values of TZmin, for example, complete- 
ness drops rapidly, while purity remains approximately 
constant. This fact sets a practical range of interest 
for 7?.mi„ (roughly between 0.15 and 0.35 Mpc), beyond 
which changes in that parameter only serve to degrade 
the quality of the catalog. Within this range, the optimal 
choice of TZmin^s deba.table , but a reasonable choice (also 
used bv lKnobel et al . 2009^ is to take the one that gives a 
result in purity-completeness space that is near the min- 
imum distance from the point (1,1) that is obtained over 
the full parameter space. We find that this optimum is 
roughly in the range 0.225 < TZmin ^ 0.3 Mpc (the red 
star denotes TZmin = 0.25 for the main DEEP2 plot and 
0.3 for the EGS plot). Since we will also find below that 
the velocity function reconstruction depends on TZmim 
we will allow this parameter to vary slightly in the next 
step. 

The current analysis already allows us to partially iden- 
tify our high-purity parameter set. Since the purity of 
the catalog effectively saturates at TZmin— 0.15, and since 
purity is not sensitive to any other parameters, we will 
be able to identify a high-purity subset of the final group 
catalogs by setting TZminto 0.15 and holding all other pa- 
rameters fixed at their optimum values, whatever those 
turn out to be. 

Another notable feature of Figure [3] is the significantly 
different purity and completeness values we obtained for 
the different mocks. This implies that our inferred suc- 
cess statistics have a strong dependence on cosmology 
and particularly on the level of clustering in each dark- 
matter simulation. This can plausibly be explained as 
follows. The abundance-matching technique used to con- 
struct the mocks requires, by construction, that the lu- 
minosity function must match the one that is measured 
in the DEEP2 data by placing galaxies in halos and sub- 
halos at fixed number density. As discussed in Gerke et 
al. (in prep.), as the clustering amplitude erg increases, 
so does the number density of halos at fixed mass; hence, 
the abundance-matching algorithm places fainter galax- 
ies in halos (and subhalos) of a given mass. This means 
in particular that massive halos will contain fewer galax- 
ies above a given luminosity — and thus fewer observable 
galaxies — as as increases. This will make these groups 
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Figure 3. Purity and completeness statistics for group catalogs computed with th e VD M group-finder over a wide range of the algorithm's 
parameter space, for each of the three different mock catalogs discussed in Section 12.21 The left panel shows results for mocks selected like 
the main DEEP2 sam ple, a nd the right panel shows results for the EGS mocks. Each point in the plots shows the ^Pq and ^Co statistics 
(as defined in Section 2331 and computed in aggregate over all mock lightcones) for a particular choice of group-finding parameters in a 
particular mock catalog. The mock catalogs are identified by the value of erg assumed for each one in the left panel (see Table[TJ; the relative 
positions of values for each of the three mocks are the same in the right-hand diagram. The well-known trade-off between completeness 
and purity is evident for each mock catalog. There are also substantial differences between the purity and completeness values measured 
in the different mock catalogs. 



more difficult to detect, since their observable galaxy 
populations will be sparser. For example, the typical 
halo with two observed members will be more massive 
in a more clustered cosmology, so its observed members 
will typically be more widely separated in redshift space. 
At the same time, increasing as enhances the clustering 
of the isolated background galaxies, making them more 
likely to be erroneously grouped together as false detec- 
tions. The net effect is that, when the luminosity func- 
tion is held fixed, increasing the galaxy clustering and 
halo mass function (e.g., by raising as and J^m) causes a 
decrease in both completeness and purity. 

This result has important implications for the opti- 
mization of group finders generally. Since mock catalogs 
are usually constructed to reproduce the observed galaxy 
luminosity function reasonably well, this effect is likely 
to be generically present in mocks with different back- 
ground cosmologies and not just in catalogs produced us- 
ing abundance matching. When assessing group-finders, 
then, it will be very important to construct mocks using 
simulations whose background cosmology are consistent 
with the current best-fit cosmological parameters. 

Mock catalogs based on semi-analytic galaxy for- 
mation mo dels applied to the Millennium Simulation 
(jSpringel e t al. 2005|) have the dual advantages of match- 
ing a wide variety of observed properties of the galaxy 
population and of being easy to obtain and use, so 
they have been widely used to test and optimize high - 
redshift group-finders. For e xample, iKnobel et al.l(l2009D 
use d the mock catalogs frq m lKitzbichler &: White! (|2007D . 
and 'Cucci ati et al.l (|2010') constructed mocks using the 
semi-analytic models of De Lucia & Blaiz ot (2007) and 
the li ghtcone-construction techniques of IBlaizot et al.l 
(|2005f ). However, the Millennium Simulation had a back- 
ground cosmology with erg = 0.9, well above the cur- 



rently preferred value of ^ 0.8. Our tests here on 
mocks with different cosmologies suggest that group- 
finders that were calibrated on the Millennium mocks 
may have significant inaccuracies in their estimated pu- 
rity and completeness statistics. (A similar statement 
can be made about our earlier group-finding efforts in 
G05, for which this paper should be considered a replace- 
ment.) More generally, our results suggest that group 
catalogs that were calibrated on mocks with disfavored 
cosmologies should be treated with caution. 

When summarizing the success and failure statistics 
of our DEEP2 group catalog, then, we will use values 
computed using the Bolshoi mock catalogs only. For the 
remainder of this Section, we will also focus our group- 
finder optimization efforts on those mocks. 

5.2. The group velocity function 

While maintaining a reasonable balance between com- 
pleteness and purity of the catalog, we also wish, by tun- 
ing the VDM parameters, to produce a catalog of groups 
that accurately reconstructs the distribution function 
of observed velocity dispersions for halos in the mock, 
dN/dav, at all redshifts of interest. More specifically, we 
will focus on the high-cTv end of the velocity function, 
since this region of the distribution is exponentially sen- 
sitive to changes in cosmology, and we ultimately wish to 
use our group catalog for cosmological tests. In G05, we 
found that VDM could reconstruct the velocity function 
of the lYan et al.l ()2004[ ) mocks accurately at a^ > 350km 
s~^; here we will also endeavor to reconstruct the veloc- 
ity function above a threshold value of a^ that is as low 
as possible. 

Throughout this study, when we discuss the velocity 
function of groups or halos, we are talking about the 
measured dispersion a^ of the member galaxies' line- 
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Figure 4. Fractional error in the recovered group velocity func- 
tion for our optimal set of DEEP2 group- finding parameters (Ta- 
ble [2]l. Data points show the median fractional difference between 
the group and halo number counts (defined precisely in the text) 
in bins of a^ and z, for the 40 Bolshoi mock lightcones. Error 
bars show the standard error computed over the 40 mocks. The 
shaded regions indicate the fractional scatter in the halo num- 
ber counts (due to both sample variance and Poisson noise). For 
(Tv > 300km s~^, the fractional errors are consistent with zero 
and/or are smaller than the sample variance for all redshifts. At 
lower velocity dispersions, the group abundance is systematically 
overestimated. 

of-sight peculiar velocities (though we will generally drop 
the superscript for brevity) . Since the overwhelming ma- 
jority of DEEP 2 groups will contain only a few galaxies, 
we make use of the so-called gapper algorithm to measure 
a^. The gapper measures velocity dispersion using the 
gaps between the measured line-of-sight velocities vi in 
a given group, after the vi have been sorted in ascending 
order: 

This has been shown to be the most robust of seve ral pos- 
sible estimators in the limit of small samples (Beer s et al.l 
[1990). 

In practice, testing the velocity-function reconstruc- 
tion amounts to comparing the number counts of halos 
and reconstructed groups in the mock catalogs, in bins 
of a^ and z. In a perfect group-finder, these histograms 
would be exactly equal. To assess the successfulness of 
the VDM algorithm, we will use a similar approach to 
the one we used in G05: we compare the fractional er- 
ror in the recovered histogram to the fractional size of 
the field-to-field dispersion in the mock histograms. The 
field-to-field dispersion serves as a proxy for the sample 
variance (sometimes called cosmic variance) in the group 



number counts. As a rough rule of thumb, where the er- 
ror in reconstructing the velocity function is smaller than 
the sample variance (and also not systematically high or 
low over a wide range in <j^\ we take the reconstruction 
to be acceptable, since the measurement error is sub- 
dominant to the irreducible uncertainty in the measured 
velocity function that arises from sampling a finite vol- 
ume of space. We will endeavor to reduce this error as 
much as possible, however, focusing mainly on the high- 
dispersion end of the velocity function, since it is the 
most sensitive probe of cosmology. 

It will be important to be careful in our choice of bin- 
ning when performing this test. Because the velocity 
function of groups is quite steep at the high-Uv end, and 
because the volume of DEEP2 is relatively small, there 
will be very few groups at large values of (Tv, especially 
at high redshift. If we choose a binning in a^ that is 
too narrow, then most high a^ bins will contain zero 
or one group, and a slight inaccuracy in the measured 
velocity dispersion of a given group could lead to very 
large apparent fractional errors, when in fact the group 
detection introduces only a minor inaccuracy in the ve- 
locity function — one that would have a negligible effect 
on the inferred cosmology. We would therefore like to use 
a coarse enough binning to ensure that every bin is ex- 
pected to contain at least a handful of groups. Since 
changes in cosmology affect the shape of the velocity 
function as well as the normalization, though, we would 
also like to choose a fine enough binning to capture at 
least some of the shape information. 

Somewhat fortuitously, this set of requirements is iden- 
tical to the one we would use to select a binning for 
performing cosmological tests with the velocity function: 
we wish to measure the shape of the velocity function 
as well as possible given our dataset, but standard tech- 
niques for constructing likelihood functions over the cos- 
mological parameters assume that the dataset contains 
at least a few objects per bin ()Hu fc Cohnll2006t ). Hence, 
for the purposes of testing the VDM velocity-function 
reconstruction, we will choose a binning that would be 
appropriate for using the resulting group catalog in cos- 
mological tests, with at least a few groups falling in 
each bin. Experimenting with the halo population in 
the mock catalogs, we find that a reasonable choice is 
to construct even bins in logj^p "'vj of width 0.1, with the 
addition of a single, broad bin covering all values above 
logj^Q (Tv — 2.75, with redshift bins of constant width 0.2. 
Because G05 found that VDM can accurately reconstruct 
the velocity function above a threshold of (Tv = 350km 
s~^, and lacking any other compelling reason to specify 
a particular positioning for our bin edges, we choose to 
arrange our logarithmic bins such that one of the edges 
falls near the G05 threshold, at logj^p '^v = 2.55. 

Within these bins, we compute the fractional difference 
between the counts of halos and groups and compare it 
to the fractional sample variance in these counts, as de- 
scribed above. By performing this procedure for group 
catalogs computed over a wide range in the VDM pa- 
rameter space, we can search for an optimum set of pa- 
rameters that minimizes the error in the reconstructed 
velocity function at high av Figure [3] shows the recon- 
struction error obtained for this optimum set in the Bol- 
shoi mock catalogs. The black points denote the median 
fractional difference between the binned number counts 
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Table 2 

VDM group-finding parameters used for the 
diflFcrent sample in tfiis study 



Parameter ' 


'■ Main 


EGS 


Higfi-Purity 


''^min 


0.25 


0.3 


0.15 


^min 


10.0 


. . . b 




7^II 


0.8 






Cu 


8.0 






r 


0.225 






l 


10.5 







^ All values are given in comoving /i^^ 
Mpc, assuming a flat ACDM cosmology 
with JIm ~ 0.3. 

^ Values not listed are the same as used 
for the main DEEP2 sample. 

of groups and halos, {Ngroups - Nhaios)/Nhaios, and the 
error bars show the standard error in this quantity, with 
the mediarr^ and standard error computed over all forty 
Bolshoi mock lightcones. The shaded regions show the 
fractional sample variance in each bin, as well as indicat- 
ing the extent of each bin in a^ . 

To arrive at the parameter set used in the figure, we 
varied the VDM parameters over a wide range in pa- 
rameter space (though constraining TZmin^o the narrower 
range of 0.225-0.3 identified in the previous section). We 
used the measured fractional reconstruction errors and 
their uncertainties (i.e., the data points and error bars in 
the Figure) to construct a statistic similar to x^ for the 
bins with CTv > 350 km s~^. We then gave detailed con- 
sideration to parameter sets near the minimum in this 
statistic, tuning the parameters by hand to find the pa- 
rameter set that gives an error in the high-dv velocity 
function that is smaller than the sample variance in all 
bins and is not systematically biased to high or low val- 
ues. These optimum parameters are listed in Table [H 
This set of parameters is also indicated by the red aster- 
isk in Figure |3l It is notable in Figure |4] that this param- 
eter set gives a fractional reconstruction error near zero 
above a threshold of a^ « 300 km s~^, a slight improve- 
ment over the value of 350 km s~^we achieved in G05. At 
lower dispersions, the velocity function is overestimated 
by the group-finder at all redshifts and this bias is larger 
than the sample variance at low z. 

While performing the optimization, we found that the 
reconstructed velocity function was sensitive to the Phase 
I and Phase III VDM parameters only; the Phase II pa- 
rameters had no noticeable impact over the range we 
considered. As discussed in Section 14. 2[ this is because 
we have allowed the Phase II cylinder to take very large 
values, such that we are simply counting all second-order 
Delaunay neighbors in Phase II in nearly all cases. This 
suggests that the Phase II cylinder serves little practical 
purpose in our implementation of the VDM group-finder, 
where we have allowed it to be very large. However, 
keeping it in place has no negative impact on the recov- 

^^ We consider the median fractional error in this diagram, 
rather than the mean, because the former statistic is more resistant 
to the outlier values we occasionally encounter. Sample variance 
and shot noise mean that a few of the Bolshoi mock liglitcones con- 
tain few (in one case zero) halos in one or more of our bins, despite 
the fact that the typical (i.e., median) lightcone contains at least 
five halos in each bin. However, the group-finder still tends to find 
at a few groups in all bins, which leads to a very large fractional 
error for a few lightcones and strongly biases the mean. 



ered catalog, and it has the virtue of ensuring that we 
never count very distant neighbors in Phase II. On the 
other hand, if we allowed the Phase II cylinder to take 
on smaller values the resulting decreases in A/ji could be 
offset by increases in the Phase III parameters, so that 
our parameter space for optimization would still be ef- 
fectively reduced to four parameters, rather than six. 

5.3. The Extended Groth Strip 

Because DEEP2 spectroscopic targets in the EGS were 
selected differently from those in the main DEEP 2 sam- 
ple, it is reasonable to expect that the optimal parameter 
set for group finding will also be different. We therefore 
repeat our parameter optimizaton procedure separately 
on the mock lightcones constructed with EGS selection. 
There are two primary differences between the EGS sam- 
ple and the main DEEP2 sample. First is the presence 
of a low-z sample in the EGS, which includes very faint 
dwarf galaxies. As discussed in Section 14.31 we handle 
this by excluding from Phase I and II galaxies fainter 
than those included in the main DEEP2 sample. The 
second difference is the somewhat denser sampling in the 
EGS permitted by the use of a four-pass slitmask-tiling 
algorithm instead of the two-pass approach used in the 
rest of DEEP2. 

This higher sampling rate would naively be expected to 
make groups easier to detect, and so one might initially 
suppose that completeness, for example, will rise relative 
to the rest of DEEP2, at fixed values of the Phase I 
"T^minParameter. Recall, however, that the Phase I step 
of VDM looks for first-order Delaunay neighbors, and 
increasing the density of objects in the galaxy catalog will 
also shorten the typical linking length in the Delaunay 
mesh. Since the Phase I cylinder has a long, narrow 
shape along the line of sight, it is possible that the denser 
sampling will actually tend to reduce the number of first- 
order neighbors within the cylinder in favor of nearer 
neighbors that do not fall along the line of sight. In this 
case, a slightly larger value for 7?,mmniight be indicated. 

In fact, we find the latter situation to be the case: gen- 
erally speaking, we need a larger value of TZminio strike 
a similar balance between between purity and complete- 
ness to the one we achieved in the main DEEP2 sam- 
ple. In Figure [21 the red asterisks in both panels sit at 
approximately the midway point between the maximum 
attainable purity and completeness values. However, in 
the EGS (right-hand panel) this point corresponds to 
^mjn=0.3, somewhat larger than the value of 0.25 we 
used for the main DEEP2 sample. We will therefore use 
Ti-min=0-'^ when finding groups in the EGS. 

One might also initially imagine that the Phase HI 
parameters should be reduced (for fixed Phase II param- 
eters) to account for an expected increase in A/ii values 
owing to the higher sampling density. However, since 
Mi counts only second-order Delaunay neighbors, it is 
far from clear that its value will be strongly coupled to 
the sampling density. Indeed, we find that, by increasing 
Ti-min^o 0.3 and keeping all other parameters fixed, we can 
obtain an acceptable reconstruction of the velocity func- 
tion at high redshifts, z > 0.7. At lower redshifts, the re- 
constructed velocity function overestimates the true one 
slightly; however, the volume probed here is quite small, 
and we do not plan to use these groups as a cosmological 
probes. In the interest of constructing a group catalog 
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that is as similar as possible to the main DEEP2 sample, 
then, we opt to hold all VDM parameters constant in the 
EGS with the exception of TZmm- 

6. THE DEEP2 VDM GROUP CATALOG 

To construct a catalog of DEEP2 galaxy groups, we 
take all galaxies that yielded good redshifts (DEEP2 
quality flag 3 or 4). We then exclude galaxies at z > 
1.5, which are mainly high-redshift QSOs and Mgll ab- 
sorbers. For the main (non-EGS) DEEP2 sample, we also 
exclude galaxies with z < 0.6. In the EGS, we exclude 
galaxies that are too faint for observation at 2 > 0.8 in 
Phases I and II of the VDM procedure, as discussed in 
section 15.31 This yields a sample of 22,144 galaxies in 
the three high-redshift DEEP2 fields and 12346 galaxies 
in the EGS. Of these, 3100 high redshift galaxies (14% of 
the sample) and 3070 EGS galaxies (25% of the sample) 
are assigned to groups by the VDM group-finder. The 
group fraction in EGS is naturally higher because of the 
higher sampling density in that field. 

6.1. The catalogs of galaxy groups 

We ran the VDM algorithm on each of these samples 
using the parameters listed in Table[2] This produces two 
catalogs for the main sample — one primary catalog and 
one high-purity catalog — and two more for the EGS. To 
identify the high-purity subset of the main group catalog, 
we match the primary and high-purity catalogs using the 
matching techniques described in section 13.1.31 Groups 
in the main catalog that are the LAOs of the high-purity 
groups are then flagged as members of the high-purity 
catalog. 

The main catalog of VDM-identified DEEP2 galaxy 
groups consists of 1295 groups with z > 0.6 in the main 
DEEP2 sample (outside the EGS) and 1165 groups in 
the redshift range < z < 1.5 in the EGS. Of these, 74% 
of the main DEEP2 group catalog and 68% of the EGS 
group catalog is made up of systems with two observed 
members (i.e., galaxy pairs). The catalog is presented in 
Table 21 in which the groups are organized by observa- 
tional field and sorted by velocity dispersion within each 
field. Groups that were also detected in the high-purity 
catalog are identified in the rightmost column of the ta- 
ble. These groups are substantially more likely than the 
rest to correspond to real dark- matter halos (though they 
are not guaranteed to do so, just as the other groups in 
the catalog are not necessarily bogus). Galaxies belong- 
ing to the groups are listed in Table[5l with each galaxy's 
unique DEEP2 identification number listed alongside the 
ID number of its parent group. These catalogs are also 
available in electronic format from the DEEP2 data re- 
lease webpag4E3; they supersede the earlier DEEP2 group 
catalog presented in G05. 

Figure[5]shows the redshift-space distribution of groups 
in DEEP2 field 3 (at 23 hours R.A.), which is the 
DEEP 2 field whose spectroscopic observations cover the 
largest area on the sky (approximately one square de- 
gree). Black points in the figure show the positions of 
field galaxies, while larger circles show groups, with the 
size of the circle indicating the group richness. Blue cir- 
cles are members of the high-purity sample; the rest of 
the groups are indicated in red. 

^° |http : //deep ■ berkeley . edu/dr4] 



Table 3 

Summary of the group-finding success statistics for tfie 
DEEP2 and EGS group catalogs 



Statistic'' 


Main 


Main hi-purity 


EGS 


EGS hi-purity 


^Co 


0.75 


0.49 


0.79 


0.45 


^Co 


0.72 


0.46 


0.74 


0.41 


^Cso 


0.70 


0.46 


0.68 


0.39 


^Cso 


0.68 


0.44 


0.66 


0.38 


iPo 


0.62 


0.71 


0.67 


0.77 


2Po 


0.61 


0.71 


0.65 


0.76 


iPso 


0.54 


0.62 


0.55 


0.63 


^Pso 


0.54 


0.62 


0.55 


0.63 


Sgal 


0.70 


. . b 


0.66 




fint 


0.46 




0.43 




Jnoniso 


0.04 




0.06 





^ Uncertainties in the purity and completeness values, 
as computed from the binomial distribution, are smaller 
than 1% and are suppressed here for brevity. 
^ Galaxy-level statistics are not reported for the high- 
purity catalogs, since these catalogs simply identify a 
subset of the groups in the main group catalogs. 

Figure [B] shows distributions of various basic group 
properties in the EGS and the main DEEP2 catalogs 
(redshift, richness, and velocity dispersion). To allow 
for a fairer comparison between the EGS and the main 
DEEP2 sample, we have also plotted histograms for the 
subsample of the EGS group catalog at z > 0.7. It 
is interesting to note that the high-redshift EGS sam- 
ple contains roughly half as many groups as the rest of 
DEEP2. This is in spite of the fact that the EGS cov- 
ers only 20% of the area covered by the main DEEP2 
sample. The higher observed group abundance in EGS 
derives partially from the higher sampling rate of galax- 
ies (EGS contains 37% as many z > 0.7 galaxies as the 
rest of DEEP2 combined, despite having only a fifth of 
the area) . This means that a larger fraction of low- mass 
halos will have two observed galaxies in the survey and 
thus qualify as groups. The higher density of groups also 
arises partially from the larger value of TZmin'Vfe used to 
detect groups in the EGS. Our purity and completeness 
values have remained roughly the same as in the rest 
of DEEP 2; since the number of observable halos has in- 
creased, this means that the number of false detections 
must also have increased, which is a result of our larger 
Phase I cylinder. 

6.2. Success and failure statistics of the catalogs 

We have chosen to optimize our group-finding param- 
eters to accurately reconstruct the velocity function of 
groups while striking a balance between the purity and 
completeness statistics ^Pq and ^Co for the catalog as a 
whole. For the catalog we obtain, it will also be useful to 
compile some of the other success and failure statistics 
defined in Section [XTl such as Sgai and /int, both for the 
catalog as a whole and as a function of redshift and group 
properties. Many of the success and failure statistics we 
will discuss in this section are summarized in Table [3] 

Figure [7] shows six different definitions of both purity 
and completeness: ^Pf and ""C/ for one-way and two- 
way matching (w = 1 and 2) and matching fractions 
/ = 0,0.3 and 0.5. Each statistic is shown as computed 
in bins of redshift, richness and velocity dispersion. It 
is notable that, for the catalog as a whole, the differ- 
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Figure 5. The redshift-space distribution of groups and field galaxies in the largest of the three DEEP2 fields {DEEP2 field 3, at 23 
hours RA). Each panel shows a different redshift range, and the three panels taken together represent a contiguous volume covering the 
redshift range 0.75 < z < 1.4. The vertical axis in each panel shows transverse distance along the Right Ascension direction on the sky; 
the plots are projected along the Declination direction. Black points show the positions of field galaxies, while red and blue circles show 
the positions of groups. Blue circles indicate groups that are part of the high-purity subsamplo, and red circles show other groups. The 
size of each circle indicates the richness of the group. The horizontal empty region about two thirds of the way from the bottom of each 
panel is caused by a gap in the DEEP2 photometry. 
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Figure 6. Histograms of various properties of the DEEP2 group 
catalog. Solid lines show properties of groups in the main sample 
(excluding the EGS), and dashed lines show groups in the EGS. 
Dot-dashed lines show EGS groups at z > 0.75 (for better compar- 
ison to the main DEEP2 sample). 

ent measures of purity and completeness are similar and 
are all quite flat with redshift. In particular, since the 
one-way and two-way statistics are quite similar, we can 
conclude that the fragmentation and overmerging rates 
are low for the catalog as a whole. However, when we 
look at the purity and completeness statistics as a func- 
tion of richness and velocity dispersion, it is clear that 
group-finding errors become more prevalent for richer 
and higher CTv groups. 

For example, if we consider a stringent 50% match- 
ing fraction, both the completeness and purity of groups 
with N > 5 are quite low, even though our usual ^Pq and 
^Co statistics are quite high. We also note that the P50 
and C50 curves show an odd-even "sawtooth" pattern in 
the richness direction. This is a manifestation of of the 
weaknesses of these statistics that we discussed in Sec- 
tion 13.11 a halo with A^ = 5 needs three detected mem- 
bers to be classified as a successful detection by the ^Pso 
standard, but a A^ = 6 halo needs four detected mem- 
bers. These plots suggest that, although a large fraction 
of high-richness halos are successfully detected, many are 
fragmented and, conversely, although almost none of the 
high-richness groups are false detections, most of them 
are either overmergers, or have a large interloper fraction, 
or both. 

This situation is unfortunate, but it is also difficult to 
improve upon. If we make a diagram similar to Figure |3] 
showing ^Pso versus ^Cso for groups with iV > 4 only, 
for example, we can see a similar trade-off between these 



two statistics, such that increases in one statistic are gen- 
erally offset by decreases in the other. We have therefore 
chosen not to further tune our parameters to improve 
the high-richness success parameters. Instead, we cau- 
tion that although high-richness DEEP2 groups can be 
treated as very high-confidence detections, their mem- 
berships may be problematic. It may therefore be advis- 
able, when considering individual high-richness groups, 
to reconsider their memberships in a non-automated way, 
e.g., as was done when considering X-ray detected EGS 
groups in Jeltema et al. (2009). 

The success as a function of velocity dispersion is po- 
tentially more troubling, especially as regards complete- 
ness. The purity is relatively flat as a function of Cv , with 
a small drop around 400 km s~^, which likely occurs be- 
cause groups with a larger Phase III cylinder are likely 
to be have a higher contamination by interlopers. A 
stronger and more sustained drop in the completeness o 
especially the two-way completeness, starts at around 
200 km s~^. At higher values of a^, the one-way and 
two-way completeness values are quite different, indicat- 
ing that the pieces of the high-dispersion halos that VDM 
detects are being merged with other (presumably lower- 
dispersion) systems. The drop in completeness can be ex- 
plained by noting that the halo population is dominated 
by systems with N = 2 (i.e., pairs) at all dispersions, 
and for these systems, a^ is simply proportional to the 
velocity difference between the members, a^ = Y^Ati/2 
(see Equation [Ij . Detection in Phase I would require 
these two galaxies to be first-order Delaunay neighbors, 
but as they become more widely separated in redshift 
space, they become increasingly less likely to be linked. 
Furthermore, the Phase I cylinder half-length has an ef- 
fective length of ^ 850 km s~"'^in velocity at z ~ 1, so 
pairs with CTv ^ 750 km s~^are completely undetectable 
in the absence of interlopers^. 

On the face of it, the rapid drop in completeness with 
a^ raises concerns about using the VDM catalog to mea- 
sure the abundance of high-dispersion groups as a cos- 
mological test. The situation may not be as bad as it 
initially appears, however. As we just explained, the ha- 
los that are likeliest to go undetected are systems with 
two observed galaxies. These will typically be halos at 
low mass whose measured CTv values have been artificially 
boosted by small-number statistical effects. Indeed, as 
shown in the lower right panel of Figure [21 the population 
of halos above any threshold in a^ includes a very broad 
range of masses, including many with M < 10^'^ Mq, ow- 
ing to the very large scatter in velocity dispersion mea- 
sured using the observed galaxies in the halos. Such low- 
mass systems that have been scattered to high a^ are 
contaminants in the high-mass sample that one would 
like to construct for use in cosmological tests. 

In the Appendix, we develop a theoretical approach to 

^^ It is important to note that this result is quite different from 
the one shown in Figure 8 of G05, which showed that ^Cso was flat 
with (Tv. As we were testing VDM for this study, we found that that 
figure suffered from a software error which erroneously calculated 
a flat completeness value. In fact a similar dropoff in completeness 
was present in the G05 catalog. Since the current study constitutes 
a replacement for G05, this error should not impact any future 
studies. 

^■^ An example of such a system is a halo with o-Jp'^ = 500km 
s~^and two observed members with typical velocities that are ori- 
ented in opposite directions along the line of sight. 
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Figure 7. Various purity and completeness statistics for the main DEEP2 group sampl e, computed as a function of redshift, richness, and 
velocity dispersion. Datapoints show the statistics ""P/ and ™Cj (defined Section l3.f .41 1 for match reciprocity w = 1,2 and match fraction 
/ = 0, 0.3, 0.5, computed over all forty mock lightcones from the Bolshoi simulation. Error bars on these values are computed using the 
binomial distribution. One-way (ui = f ) statistics are shown as dotted lines and two-way statistics as solid lines. 

group catalog very carefully. Such a detailed analysis of 
the suitability of our group catalog for cosmological tests 
is beyond the scope of this study though, and apart from 
a brief discussion in the Appendix, we defer it to future 
work. 

In the case of the EGS, we find similar (though nois- 
ier) trends of purity and completeness with N and a^ ■ As 
shown in Figure [51 however, the purity and completeness 
statistics are roughly constant at high redshift, but they 
degrade somewhat at the redshifts below those probed 
in the rest of DEEP2. The likely reason for this has to 
do with the effective sampling rate of bright group mem- 
bers in the low-redshift EGS. As we discussed in Sec- 
tion 14. 3( when we perform VDM Phases I and II in this 
regime, we consider only galaxies bright enough that they 
could have been observed at z > 0.8. The EGS target- 
selection downweights low-redshift galaxies so that the 
redshift distribution of targets is roughly constant, and 
since the low-redshift pool of targets also includes faint 
galaxies that might be observed in lieu of bright ones, 
the sampling rate of bright galaxies is sharply reduced 
at low redshift relative to what can be attained at high 
redshift. A more sparsely sampled galaxy population is 
certain to result in a lower-quality group catalog, since 
there are fewer observable halos overall, and their mem- 
bers are more widely separated. Thus, although we have 
attempted to select groups as uniformly as possible, there 
are limits to how well this can be accomplished. For 
this reason, we advise particular caution when compar- 
ing the low-redshift group catalog in the EGS to the rest 
of DEEP2. 

In addition to identifying individual groups, the group 
catalog also partitions the galaxy catalog into a set 
of group members and a set of field galaxies. Since 
galaxy groups and clusters are an interesting environ- 
ment for studying galaxy formation and evolution, it 
will be worthwhile to consider the success and failure 
statistics for the galaxy population that we defined in 
Section [3TTJ These statistics, computed for the catalogs 
as a whole, are summarized in Table [31 As shown in 
Figure [21 these values are nearly constant with redshift 
at high z; however, there is a significant worsening of 
the galaxy success statistics at low redshift in the EGS. 
The increase in interloper fraction at low redshift is likely 



Figure 8. Similar to the left-hand panel of Figure[7l but for the 
EGS mock catalogs. 

predicting the velocity function of an observed group cat- 
alo g as a function o f cosmo logical parameters, expanding 
the I Newman et all (2002) analysis to include observa- 
tional selection effects and the impacts of contamination 
from low-mass systems with high measured dispersion 
values. Although it is essential to include these effects in 
the theoretical predictions, it is not necessary that the 
actual contamination in the group catalog arise from the 
exactly the same low-mass halos that have large true ve- 
locity dispersions, as long as the contamination in the 
group catalog is at the same level as expected in the halo 
population. Since our group catalog accurately repro- 
duces the halo velocity function at high a^ (as shown 
in Figure [4]) , it appears that the contamination levels at 
high dispersion in the halo and group popluations are 
similar. That is, the impurities in the catalog (which 
are also mostly pairs) account for the undetected pairs 
at high dispersion. It may thus still be reasonable to 
use this group c atalog for a test like the one proposed in 
[Newman et al.l (|2002 ). although it will be important to 
investigate the incompleteness and contamination of the 
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caused by the presence of faint isolated galaxies that hap- 
pen by chance to fall in the Phase III cylinder of some 
group (the Phase III cylinder can be many tens of Mpc 
long in comoving line-of-sight distance to account for the 
finger-of-god effect.) The reduction in S'gai likely derives 
from the lower sampling rate in the low-z EGS that we 
discussed above. 

Finally, we note that within the halo model for galaxy 
formation, it is natural to divide the galaxy population 
into central galaxies, which are located at the centers of 
distinct dark-matter halos, and satellite galaxies, which 
are located in subhalos. Considering the properties and 
evolution of centrals and satellites separately has recently 
been a popular and fruitful approach to galaxy evolu- 
tion studes. Ideally, one would hope that the population 
of field galaxies identified by a group-finder would con- 
stitute a relatively pure sample of central galaxies that 
could be studied separately from the satellite popula- 
tion. In tests on the DEEP2 mock catalogs, we find that 
the fraction of field galaxies that are actually satellites is 
11%, which is a mild improvement over the 16% satellite 
fraction in the overall population of galaxies in the mock. 
(These numbers are 12% and 18%, respectively, in the 
EGS mocks.) The satellite fraction can be higher than 
/noniso sincc somc truly isolated galaxies in DEEP 2 will 
be satellites if their central galaxies were not scheduled 
for DEIMOS spectroscopy or did not yield good redshifts 
(this also means that even a perfect groupfinder would 
not yield a satellite fraction of zero for the field galaxies). 

6.3. Comparison to other high-redshift spectroscopic 
group catalogs 

There are two other spectroscopic galaxy surveys in the 
literature that bear a particular resemblance to DEEP2 
in targeting galaxies at redshift of order unity with sam- 
ple sizes on the order of tens of thousands of galax- 
ies. These are the V IMOS VLT Dee p Survey (VVPS : 
ILeFevreet all [20051) and zCOSMOS (ILillv et al.l 120091) 
samples. Group catalogs have been computed for 
both of these s ample s using the VDM gr oup-finder by 
iCucciati et aD (|2010h (herefter CIO) and iKnobel et atl 
()2009f ) thereafter K09), respectively. (The latter authors 
also computed a group catalog using a friends-of-friends 
algorithm and used information from both algorithms to 
construct a final group catalog for zCOSMOS.) Since the 
samples and methods used in those studies are broadly 
similar to ours here, it will be useful to compare the 
DEEP2 group catalog with the VVDS and zCOSMOS 
results. 

The DEEP2 group catalog presented here is larger 
than the VVDS and zCOSMOS catalogs, and it probes 
higher typical redshifts. The DEEP2 catalog contains 
3016 groups in total, compared with 318 in VVDS and 
1473 in zCOSMOS. The latter two catalogs lie primar- 
ily in the range 0.2 < z < 1.0, with only a few systems 
at higher z, whereas the DEEP2 catalog focuses on the 
range 0.7 < z < 1.4. 

There are important differences among the reported 
success and failure statistics of the various catalogs. As 
we do, CIO find that they can accurately reconstruct the 
velocity function of groups for CTv ^ 350km s~^. K09 do 
not consider the velocity function, but their VDM cat- 
alog significantly overestimates the group abundance at 
all richnesses, while their combined friends-of-friends and 
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Figure 9. G alaxy-level success and failure statistics (defined in 
Section 13.1.21 for the main DEEP2 sample (top) and EGS (bot- 
tom), as a function of redshift. The solid lines show the galaxy 
success rate, 5gai; dashed lines show the interloper fraction, /int, 
and dotted lines show the nonisolated fraction, /noniso- 

VDM catalog accurately reconstructs the abundance. 
CIO and K09 both report purity and completeness val- 
ues using the one-way and two-way P50 and C50 statis- 
tics, so we can compare our catalog with theirs on those 
terms. We will focus on two-way statistics, since these 
are the most stringent test of group-catalog quality and 
since the one-way and two-way statistics for our cata- 
log are quite similar. Table |3] summarizes various purity 
and completeness statistics for the DEEP2 catalog. In 
terms of overall purity and completeness, this catalog, 
with (^i-50,^ C'50) = (0.54,0.68) is intermediate between 
the zCOSMOS (0.72, 0.78) values and the VVDS values 
of (0.49, 0.53). This is not unexpected, since the DEEP2 
sampling rate is intermediate between the other two sur- 
veys, and sampling density is the most important factor 
governing group-finding success in redshift space. 

When we consider the completeness and purity for 
high-richness groups, however, there are some rather 
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striking differences between the DEEP2 catalog and the 
other two. As shown in Fig [3 the DEEP2 P50 and C50 
statistics are rather low for groups and halos with more 
than three members. By contrast, both CIO and K09 
report purity and completeness statistics that are nearly 
flat with richness. As we discussed above, we have not 
been able to find a parameter set that increases the high- 
richness purity or completeness without causing a decline 
in the other paremter. We have run VDM on the DEEP2 
(Bolshoi) mock catalogs using the optimal parameter sets 
determined in CIO and K09 and have found that in both 
cases the high-richness completeness is substantially im- 
proved, but this is at the expense of the purity. In any 
event, we note that the ^Pq and ^Cq statistics are quite 
high for our high-richness halos and groups, so it is not 
correct to say that they are mostly failures. Instead, the 
membership assignment appears to be somewhat prob- 
lematic for these systems. 

There are a few possible explanations for the differ- 
ences between our high-richness statistics and the oth- 
ers. In the case of K09, it may be possible to partially 
or fully account for the difference by the fact that zCOS- 
MOS has a higher sampling density than DEEP2. How- 
ever, the difference is quite puzzling in the case of CIO, 
since the VVDS sampling rate is substantially lower than 
DEEP2's. Another difference that distinguishes both the 
CIO and the K09 catalogs from ours is their redshift 
range covered and the resulting luminosity range probed. 
The main DEEP2 sam ple at z > 0.75 only probes galax- 
ies with L>L* (Willmer et al.|[20061 . so a DEEP2 group 
with five members, say, has five bright members and thus 
is quite massive. With similar flux limits to DEEP2, but 
lower redshift ranges, VVDS and zCOSMOS wfll probe 
faint dwarf galaxies at low redshift, and only brighter 
galaxies at high redshift, so that systems with five mem- 
bers will be much less massive at z = 0.1 than at z = 0.9. 
The presence of the fainter galaxies in lower-mass sys- 
tems means that the typical effective sampling density of 
groups win be higher in VVDS and zCOSMOS than it is 
in DEEP2 (recall that we exclude faint galaxies from the 
group-detection step at low redshift in the EGS). This 
may make the detection of high-richness groups easier in 
the lower z samples. 

A final potential explanation for the differences in re- 
ported purity and completeness statistics is the differ- 
ences between the mock catalogs used to estimate these 
values. As we showed in Section [5. 1[ small differences in 
the background cosmology of a mock catalog can lead to 
significant differences in the measured purity and com- 
pleteness statistics for the VDM groupfinder. CIO and 
K09 both utilized mock catalogs based on the Millennium 
simulation, whose cosmological parameters (particularly 
the value erg = 0.9) lie well outside the range preferred by 
current data. In the case of our mock catalogs, a higher 
value of CTg leads to depressed purity and completeness 
statistics, but since the Millennium mocks used a dif- 
ferent prescription for adding galaxies than we do, it is 
difficult to intuit the effect that the background cosmol- 
ogy will have on the inferred success and failure statis- 
tics. Nevertheless, since properties of the mock galaxies 
(e.g., the luminosity function) match observations, while 
the halo population is not consistent with the favored 
cosmology, the halo occupation statistics of galaxies are 
likely to be inaccurate. One must therefore interpret 



purity and completeness statistics computed from these 
mocks with some caution. 

In this study, we have made a substantial effort to 
use mocks (the Bolshoi mocks) whose galaxy population 
and background cosmology are as consistent as possible 
with a wide range of observations. We have also taken 
pains to ensure that we are striking the best possible bal- 
ance between completeness and purity given the stated 
goals of our group-finding efforts. Although the CIO and 
K09 catalogs nominally appear to have better purity and 
completeness than ours (at high richness and under very 
stringent definitions of purity and completeness), given 
the differences between our mock catalogs and theirs, it 
is nevertheless difficult to say whether this reflects a true 
difference among the datasets or an insufficiently realistic 
set of mocks in the earlier studies. 

7. CONCLUSION 

We have constructed and released to the public a cat- 
alog of galaxy groups detected in redshift space from 
the spectroscopic galaxy sample in the DEEP2 Galaxy 
Redshift Survey, including the Extended Groth Strip. 
Groups are d etected using the Vo ronoi-Delaunay Method 
group finder (jMarinoni et al. '2002'). as optimized on new, 
highly realistic mock galaxy catalogs, using similar tech- 
niques to the ones we developed in G05 for early DEEP2 
data, using a previous generation of DEEP2 mocks. The 
catalogs developed here replace those presented in G05. 
Because galaxies in the EGS region are selected with a 
different algorithm than used in the rest of DEEP2, we 
perform a separate optimization of the group finder on 
mock catalogs specially constructed to resemble the EGS. 
We also chose to exclude faint, low-redshift galaxies from 
the first steps of group-finding in the EGS, to ensure that 
the selection of the EGS group catalog is approximately 
uniform in redshift and similar to the rest of DEEP2. 

In the course of performing the optimizations, we de- 
veloped a generalized definition of the purity and com- 
pleteness of the group catalog, which avoids some prob- 
lems that arise under the definitions we used in G05. We 
argue in particular that the G05 definition is overly strin- 
gent and, by construction, gives results that are strongly 
(and spuriously) richness dependent for low-richness sys- 
tems. Using our more general purity and completeness 
measures, we find that our optimized group catalog is 
72% complete and 61% pure for the main DEEP2 sam- 
ple, and 74% complete and 67% pure for the EGS. We 
also construct a group catalog using parameters chosen 
to maximize purity (at the expense of completeness), and 
we make note of these systems in our main catalog. The 
high-purity subsample is 71% pure (76% in the EGS). We 
also estimate our success in partitioning the galaxy cat- 
alog into group and field subsamples. We find that our 
group-finder correctly identifies 70% of true group galax- 
ies, with 46% of the identified group galaxies being inter- 
lopers from the field, while 4% of identified field galaxies 
are actually members of groups (these numbers are 66%, 
43% and 6%, respectively, in the EGS). A more com- 
plete listing of group-finding success and failure statis- 
tics, including different definitions of the completeness 
and purity, is given in Table [31 

To optimize the group-finding parameters, we made 
use of mock catalogs constructed using three different N- 
body simulations, with varying values of the cosmolog- 
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ical parameters flM and as- The purity and complete- 
ness values we estimate from these catalogs vary quite 
strongly with the assumed cosmology. This underscores 
the importance of opimizing the group finder on mocks 
that are consistent with as broad a range of data as pos- 
sible, including both cosmological information and the 
properties of the galaxy population. We have taken care 
in this work to report statistics from mock catalogs that 
match the galaxy data and are consistent with all cur- 
rent cosmological constraints. We note that earlier high- 
redshift group-finding efforts (e.g., G05, K09, CIO) have 
been optimized on mocks with now-disfavored cosmolo- 
gies, so their reported purity and completeness statistics 
should be treated with caution. 

As we did in G05, we find that the optimized VDM 
group-finding algorithm can reconstruct the distribu- 
tion of groups in velocity dispersion and redshift for the 
main DEEP2 sample, above a threshold in dispersion of 
CTv ~ 300km s~^, to an accuracy better than the sam- 
ple variance in this distribution. Moreover, our optimal 
parameter set can achieve this level of accuracy for all 
three mock cosmologies considered here, in spite of the 
different purity and completeness statistics obtained fo r 
the different mocks. As shown in iNewman et al.l (|2002l ). 
the velocity function of groups can be used as a probe 
of cosmological paramters; the fact that VDM can ac- 
curately reconstruct this distribution for a range of cos- 
mologies makes the DEEP2 group catalog presented here 
a promising dataset for this test, although a more de- 
tailed accounting for sources of incompleteness and con- 
tamination in the group catalog will be important in any 
such analysis. 



Table 4 

The DEEP2 VDM group catalog (A sample is shown here. The 

full table is available in electronic form in the electronic edition of 

the Journal or from the DEEP2 DR4 web page.) 



ID« 


R.A.b.<= 


Decl.b.c 


z^ 


N 


<x„d 


H.-P.<= 


2052 


02 3122.39 


+00 3519.6 


0.92 


6 


878 


Yes 


2450 


02 26 59.36 


+00 47 31.0 


1.01 


3 


873 


Yes 


2328 


02 30 33.57 


+00 30 34.2 


1.04 


3 


873 


Yes 


2151 


02 30 26.08 


+00 38 04.5 


0.87 


4 


819 


Yes 


2147 


02 27 59.14 


+00 38 47.4 


0.93 


5 


813 


No 


1912 


23 30 36.04 


+00 04 05.2 


0.96 


3 


930 


No 


1835 


23 32 26.54 


+00 08 05.0 


1.21 


3 


894 


Yes 


1764 


23 29 41.23 


+0015 02.8 


0.83 


3 


712 


No 


2018 


23 25 50.32 


+00 20 03.0 


1.00 


2 


708 


No 


1908 


23 3315.36 


-00 02 34.4 


0.89 


2 


685 


Yes 


1416 


16 49 52.53 


+35 05 53.6 


1.12 


3 


960 


Yes 


1172 


16 5115.36 


+34 5319.0 


0.82 


3 


813 


Yes 


1175 


16 53 09.44 


+35 0017.5 


0.86 


4 


808 


Yes 


1186 


16 48 08.06 


+34 56 08.6 


0.77 


3 


807 


Yes 


1391 


16 52 50.72 


+34 52 24.2 


1.27 


3 


667 


Yes 


998 


1418 29.23 


+52 50 40.8 


0.73 


3 


902 


No 


1078 


1415 20.47 


+5219 29.4 


0.91 


4 


889 


No 


334 


14 22 02.89 


+53 25 06.2 


0.26 


3 


853 


Yes 


857 


14 23 40.21 


+53 3159.3 


0.58 


3 


847 


Yes 


142 


1415 30.90 


+5216 37.4 


0.46 


5 


845 


Yes 



'^ Unique identification number for each group. 

^ Units of right ascension are hours, minutes, and seconds, and 

units of declination are degrees, arcminutes, and arcseconds. 

All values are given for J2000.0 Equinox. The Extended Groth 

Strip is at 14 hours right ascension. 

'^ Median value of all galaxies in the group. 

'^ Given in km s~^. 

° Groups detected with both the standard and the high-purity 

parameters are indicated as High-Purity. 
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1166 


22019550 


1166 


22027206 


1167 
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1010 
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11019555 


1012 
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number for each group, 
as in Table m 
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APPENDIX 

ANALYTICAL TECHNIQUE FOR PREDICTING THE VELOCITY FUNCTION OF GROUPS 

One of the primary stated purposes of this study is to create a catalog of massive systems that can be used to constrain 
cosmological parameters by measuring the observed distribution of groups in redshift and velocity dispersion. As a 
service to future users of the catalog, we outline here a method for computing this distribution as a function of 
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cosmological parameters. All of the calculations outlined below can be straightforwardly performed to include redshift 
dependence; we do not explicitly include this for succinctness. 

The goal is to constrain the mass function of dark-matter halos, whose dependence on cosmology is well understood 
theoretically, using the measured velocity dispersion of groups as a proxy for halo mass. Doing this requires an 
analytical technique for converting from the halo mass function to the velocity function, i.e., the distribution of galaxy 
groups as a fu nction of their observe d velocity dispersion a^. When forecasting the constraints that might be possible 
from DEEP2, iNewman et al.l ()2002f) developed a method for converting from the mass function to the halo velocity 
function, as would be measured using the dark matter velocity dispersion a^^ , but that analysis did not account for 
the substantial additional scatter in the M — a^ relation arising from observational selection effects and the limited 
number of galaxies used to calculate each group's Cv (as shown in Figure [5]). 

In any case, the first step is to compute the halo mass function dnj dM and convert it to the halo velocity function 
dn/da^^. The theoretical halo mass function has a relatively simple dependence on cosmological parameters and 
has been calibrated against N-body simulations to an accuracy better than required for DEEP2 comparisons (e.g., 
[Tinker et al.l 120081 and references therein). The relation between halo mass M and cr^^ is also well calibrated from 
N-body simulations (jEvrard et al.l 120071 1 . so we can use this relation to convert from dn/dM to dn/da^^ directlyo 
Then the observed velocity function takes the form of a convolution to account for observational scatter: 



dn/da: 



gal _ 



dn/da^'"P{c7^^'\a^'")da. 



gall DM ^, 



,DM 



(Al) 



where a^ is the velocity dispersion measured using the observed galaxies in the group. 

The remaining task is then to compute the conditional probability distribution function (PDF) in Equation lAll 
Provided that the number of galaxies N per halo is known, and assuming that these galaxies randomly sample the 
dark matter velocity field (i.e., that there is no velocity bias) and that the underlying velocity distribution is normal, 
the problem reduces to computing the distribution of measured dispersions obtained by sampling a Gaussian N times. 
The form of this distribution depends on the estimator we us e to conipute the measured dispersion. For the usual 
standard deviation estimator, it can be computed analytically (jKennevlll951h : 



PsdicT^^ 



gali^DM 



iV)cx 



1 



rDM 



T-gal 

.DM 



N-2 



exp 



N 



T-gal 

-DM 



(A2) 



with the normalization being set by the condition /^ P da^^^ — 1. 

Our preferred estimator, however, is the gapper. Equation [TJ The sorting of the data points required by that 
estimator makes analytical computation of the desired conditional PDF intractable for iV > 2. Howe ver, for the N — 2 
case, we can derive a closed form, via a calculation very similar to the one that produced Equation IA2I 



Pg{(t: 



gal|^DM 



,iV = 2)cx 



rDM 



exp 



T-gal 
^ V 

tDM 



(A3) 



The notable similarity between the N — 2 cases of Pg and Psd suggests a possible approximate form for Pg in the 
case of general N, 



Pg{^ 



gal|^DM 



.N) 



1 



tDM 



T-gal 

V 

,.DM 



N-2 



exp 



a{N) 



T-gal 
^ V 

tDM 



(A4) 



where a{N) is a free parameter that needs to be determined by numerical calulations. Monte-Carlo experiments 
indicate that the one-parameter fitting formula. Equation I A41 is a remarkably good model for the trf^' PDF. By fitting 
this function to Monte-Carlo simulated distributions over the range 2 < N < 25, we find that the parameter a{N) is 
well described by 

aiN) = N-2[1--] 1 + . (A5) 
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Armed with the above formula for PG{a^ {crl; , N), we can write down a form for the desired conditional PDF: 



P(afVD 



■all DM 



JVobB=2 



[p(7Vobskv )p(arkr\^obs)] 



(A6) 



where TVobs is the number of observed galaxies in a particular halo. The lower limit of the sum obtains because systems 
with A'obs < 2 do not qualify as groups; we note that this sum thus also implicitly encodes the halo selection function 
in full detail, provided that we have an accurate form for P(A'obs|o'^'^)- 



^^ In principle there is some scatter about the M—a^^ relation, 
so the conversion should actually take the form of a convolution 
of the mass function with the distribution of crj?'^ at fixed M, but 
in practice this scatter is much smaller than the observationally 



induced scatter we discuss below, so it can be neglected in favor of 
a simple change of variables. 
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Figure 10. On-sky density of VDM galaxy groups in tiie DEEP2 survey and in the mocks, using the same redshift and velocity-dispersion 
bins that we used in Figure |4] DEEP2 points show the density of groups in the three high-redshift DEEP2 fields, and error bars show 
Poisson uncertainties on these measurements. Mock data points show the mean counts from 20 one-square-degree mock lightcones. Mock 
error bars show the estimated DEEP2 sample variance, which is computed as the standard deviation of the values from the 20 mocks 
reduced by a factor of 0.65 to account for the three independent DEEP2 fields, two of which are only partially observed. 



What remains, then, is to determine P{NohsW^^)- This can be done in principle using sufficiently realistic mock 
catalogs to measure the desired PDF directly. Since this is likely to be noisy, however, it may be preferable to use the 
techniques of the halo model: one can simply measure the ob served HOD i Vnh.s(M ) and assume a simple well-motivated 
model for the scatter about this mean, such as the model of iZheng et al.l (|2005[ ). who use Poisson scatter for satellites 
and a nearest-integer distribution for centrals. We note that, since this is the HOD of observed galaxies, including 
all spectroscopic selection effects, the form of the HOD will be different from the one that is usually assumed — for 
example, the mean number of centrals will always be less than unity — but the scatter model should still be applicable. 

Thus, given sufficiently realistic mock catalogs, it is possible to predict the observed group velocity function analyt- 
ically. The question of what constitutes sufficient realism in the mocks warrants brief discussion, though. Since we 
are using the mocks to compute the observed HOD, we would like them to accurately (a) reflect the true total HOD 
N{M) in the Universe and (b) include all selection effects relevant to the dataset being used. The first requirement 
is equivalent to demanding that the mocks reproduce the observed two-point correlation function of galaxies, and the 
second necessitates careful modeling of galaxy properties to match the data. As discussed above and in Gerke et al. 
(in prep.), the mock catalogs we use here meet the latter requirement but not the former. However, as also discussed 
in Gerke et al. (in prep.), the improvements to the mock catalogs that are likely to improve the clustering agreement 
are also likely only to impact the low-mass portion of the HOD, where iV < 1. Since the above formalism only relies on 
objects with A'obs > 2 , there will be little contribution from this inaccurate portion of the HOD. Thus, it is possible 
that the mocks we used in this study are sufficiently accurate to carry out the modeling described here; however, this 
assumption would need to be tested in some detail before robust cosmological constraints could be derived from the 
velocity function of DEEP2 groups. 

A simple way to test this within the confines of the present study is to compare the abundance of DEEP2 groups in 
bins of (Tv to the abundance of groups detected by VDM in the mocks. If we assume that the mocks are a sufficiently 
accurate representation of the galaxy population in groups, then this test constitutes a consistency check between 
DEEP2 and the background ACDM cosmology of the Bolshoi mocks (which is very close to the current best fit to 
the available data). Figure [TOl shows this comparison. Data points show the on-sky density of groups detected with 
VDM in DEEP2 and in the mocks, for groups with a^ > 350 km s~^, using the same binning in redshift and Cv 
that we did in Figure |4l The DEEP2 points are measured from the three high-redshift DEEP2 fields, and the error 
bars are Poisson uncertainties. The mock data points and error bars show the mean and standard deviation in the 
counts from 20 different mock lightcones, with the error bars reduced by a factor of 0.65 to account for the reduction 
in sample variance expected by combining the three independent high-redshift DEEP2 fields, given that two of them 
were only partially observed. We computed this factor using the sample-variance estimation techniques developed in 
iNewman fc Davisi feOOO) . 

The mock and DEEP2 group abundances are in good statistical agreement at z < 1, suggesting that DEEP2 is in 
good agreement with the Bolshoi background cosmology and mock galaxy modeling in this regime. In the highest 
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redshift bin, all three DEEP2 measurements lie below the mocks at roughly the 2cr level. It is important to note that 
the three DEEP2 data points at each redshift are strongly correlated with one another, since the entire halo mass 
function will be scaled up or down in overdense or underdense regions of the universe. Hence, the similar disagreement 
for all three high-redshift CTv bins does not necessarily indicate a systematic difference. We can conceive of three 
possible explanations for this discrepancy, which we list in decreasing order of plausibility: (1) the mock catalogs may 
not capture the DEEP2 galaxy population at z > 1 sufficiently accurately to model the group selection function, (2) 
the DEEP2 survey may probe a volume of the universe that is underdense at the ~ 2a level at 2; > 1, or (3) the 
growth of cosmic structure in the Bolshoi mocks is inconsistent with the real universe, indicating a problem with the 
current concordance cosmology. Construction of a DEEP2 mock catalog that more accurately reproduces the measured 
two-point function would help to settle this question. 



